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Abstract. Fuchsian groups with a modular embedding have the richest arith¬ 
metic properties among non-arithmetic Fuchsian groups. But they are very 
rare, all known examples being related either to triangle groups or to Teich- 
miiller curves. 

In Part I of this paper we study the arithmetic properties of the modu¬ 
lar embedding and develop from scratch a theory of twisted modular forms 
for Fuchsian groups with a modular embedding, proving dimension formulas, 
coefficient growth estimates and differential equations. 

In Part II we provide a modular proof for an Apery-like integrality state¬ 
ment for solutions of Picard-Fuchs equations. We illustrate the theory on a 
worked example, giving explicit Fourier expansions of twisted modular forms 
and the equation of a Teichmiiller curve in a Hilbert modular surface. 

In Part III we show that genus two Teichmiiller curves are cut out in Hilbert 
modular surfaces by a product of theta derivatives. We rederive most of the 
known properties of those Teichmiiller curves from this viewpoint, without 
using the theory of flat surfaces. As a consequence we give the modular em¬ 
beddings for all genus two Teichmiiller curves and prove that the Fourier devel¬ 
opments of their twisted modular forms are algebraic up to one transcenden¬ 
tal scaling constant. Moreover, we prove that Bainbridge’s compactiflcation 
of Hilbert modular surfaces is toroidal. The strategy to compactify can be 
expressed using continued fractions and resembles Hirzebruch’s in form, but 
every detail is different. 


Introduction 

Modular forms are certainly best understood for the full modular group SL(2, Z), 
closely followed by those for its congruence subgroups and other arithmetic groups. 
Among the non-arithmetic Fuchsian groups, the groups having the best arithmetic 
properties are those admitting a modular embedding. Here, modular embedding 
refers to the existence of a map : H —>■ H intertwining the action of a Fuchsian 
group F and its Galois conjugate. The notion of modular embedding (in this sense) 
appears for the first time in work of Cohen and Wolfart ([10]). They show that 
triangle groups admit modular embeddings, and for more than a decade these 
remained the only examples. An infinite collection of new examples were found 
with the discovery of new Teichmiiller curves by Calta [8] and McMullen [23]. To 
find the modular embeddings for them is one of the motivations for this paper. 

For a reader whose main focus is modular forms this paper wants to advertise 
an interesting new class of modular forms. For example, we explain an integrality 
phenomenon for the coefficients of a solution of a Picard-Fuchs differential equation, 
like Beukers’s modular proof of the corresponding phenomenon for Apery’s famous 
differential equations, except that this time the explanation requires using a pair 
of “^-coordinates.” For a reader with main focus on Teichmiiller curves, we show 
how to rediscover many of their properties without referring to the theory of flat 
surfaces. 

The paper has three parts, linked by the aim to describe modular embeddings. In 
Part I we set up a general theory of modular forms for Fuchsian groups admitting a 
modular embedding. We call these twisted modular forms and we prove the twisted 
analogs of the properties that appear in most textbooks about modular forms in 
the untwisted case. In Part II we start from an example of a Fuchsian group with 
modular embedding where the Picard-Fuchs differential equations can be explicitly 
computed. We invite the reader to discover the properties of Fourier coefficients of 
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the modular embeddings and of twisted modular forms via this worked example. 
In Part III we give explicitly the Fonrier expansions of the modular embedding for 
the genus two Teichmiiller curves found by Calta [8] and McMullen [23]. In the rest 
of the introduction we describe the results in more detail, highlighting the main 
theorems (not necessarily in the same order) with bullet points. 


Modular embeddings and twisted modular forms. Suppose that the Fuch- 
sian group F C SL(2,R) has coefficients in a quadratic number field K with Galois 
group generated by a. The essential ingredient of a modular embedding for F is a 
holomorphic function (/? : H —>■ IH with the F-equivariance property 

/az + b\ _ a‘^(p{z) + b'^ 

^\cz + dJ (p{z) + 

We show that such a modular embedding always has a “Fourier expansion,” and an 
old theorem of Caratheodory gives us a good estimate for its Fourier coefficients. 
Analyzing ip leads us to the definition of a new kind of modular forms that we 
call ip-twisted modular form of bi-weight (k,i). This is a holomorphic function 
/ : H —>■ C with the transformation property 

For example, direct calculation shows that ip'{z) is a twisted modular form of bi¬ 
weight (2, —2). 

We develop a theory of twisted modular forms from scratch, analyzing to which 
extent classical topics of modular forms generalize to this new notion. Our first 
topic is the coefficient growth. 

• For ^ > 0 and k -\- £ > 2 the Fourier coefficients of a twisted modular form 

fi^) = (^>^) satisfy the estimate a„ = 

Similar estimates are given for other bi-weights as well; see Theorem 2.1 for the 
complete statement. The proofs combine the well-known Hecke argument in the 
untwisted case and the mechanism underlying the eqnidistribution of long horo- 
cycles. 

The next classical topic is the dimension of the space of modular forms. A 
modular embedding comes with one basic invariant A 2 , that one can view is several 
ways: as an integral over a conformal density (10), as a ratio of the degrees of the 
natural line bundles whose sections are twisted modular forms, or as a Lyapunov 
exponent for the Teichmiiller geodesic flow in the case of Teichmiiller curves. 

• For k £ even and k X^l > 2 the dimension of the space of twisted modular 
forms of bi-weight (A:, C) is the sum of (fc -1- X^l — 1) ((/ — 1) and contributions from 
the cusps and elliptic fixed points. Here g denotes the genus of H/F. 

For a torsion-free Fuchsian group this is of course a classical Riemann-Roch calcu¬ 
lation. Hence the main point is to determine the elliptic fixed point contributions 
in the twisted case. See (17) for the definition of the characteristic quantities of 
elliptic fixed points and Theorem 3.2 for the complete statement. 

Finally, the statement that modular forms expressed in terms of a modular func¬ 
tion satisfy differential equations also carries over to the twisted case. 
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• If f{z) is a twisted modular form of bi-weight {k,£) > (0,0) and t a modu¬ 
lar function, then the function y{t) defined locally by y{t{z)) = f{z) satisfies 
a linear differential equation of order {k + 1){£ + 1) with algebraic coefficients 
(Theorem 4.1). 

Modular embeddings via differential equations. The starting point of 
the whole project was a worked example, the Teichmiiller curves for D = 17, that 
we present in Section 6-8. (The definition of Teichmiiller curves along with a 
summary of the classification results for Teichmiiller curves in genus two is given 
in §5.3 and §5.4.) Starting from the flat geometry definition we briefly explain the 
derivation of the equation of the Teichmiiller curve as family of hyperelliptic curves 
and computation of the Picard-Fuchs differential equations, following [5]. 

In this way, we present in §7.1 the Fourier expansion of twisted modular forms 
explicitly. The corresponding group F is in thise case neither arithmetic nor com¬ 
mensurable to a triangle group, and the differential equations expressing the mod¬ 
ular forms in terms of a hauptmodule for F (= a suitably scaled modular function 
t : H/F —>■ of degree 1) is not hypergeometric. 

The twisted modular forms have two curious properties. 

• The twisted modular forms do not have a power series expansion in 77[[(?]] for the 

standard modular parameter q = where a. is the width of the cusp, but 

lie instead in TbjjAlg]], where A is a transcendental number of Gelfond-Schneider 
type (i.e., a number of the form with all Pi algebraic). 

• If / is a twisted modular form and t a suitably scaled modular function, then 
the function y{t) locally defined by y{t{z)) = f{z), with t{z) as above, has Ok- 
integral Taylor coefficients. 

The second of these observations was already proved in [5] using p-adic differen¬ 
tial equations. In Section 8 we will give a “modular” proof of both statements. The 
surprising feature here is that, while the classical proof by Beukers of the integrality 
of the Apery coefficients using modularity relies on the integrality of the Fourier 
coefficients of the ^-expansions of modular forms on arithmetic groups, here the 
expansions of both f(z) and t{z) with respect to Aq have coefficients with infinitely 
many prime factors in their denominators, and yet the integrality of / with respect 
to t still holds. To give a modular argument for this integrality, we have to use the 
relationship between twisted modular forms on the Teichmiiller curve and Hilbert 
modular forms on the ambient surface. 

For D — 17 the full ring of symmetric Hilbert modular forms has been determined 
by Hermann ([16]). We recall and use his construction to write down explicitly the 
equation of the Teichmiiller curves for D = 17 on the (rational) symmetric Hilbert 
modular surface in Theorem (8.3). 

• There exist coordinates U, V on the Hilbert modular surface Xn^ explicitly given 
in terms of theta functions, such that the two Teichmiiller curves on X^j are cut 
out by the quadratic equations (70) and (71). 

Modular embeddings via derivatives of theta functions. The concrete 
example D = 17 led us to the discovery of a general construction of the modular 
form cutting out Teichmiiller curves. 
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• The vanishing locus of the Hilbert modular form D9 of weight (3,9), given as a 
product of derivatives of odd theta functions, is precisely the union of one or two 
Teichmiiller curves on the Hilbert modular surface (Theorem 9.1). 

Starting from the theta function viewpoint we prove the characterizing properties 
of genus two Teichmiiller curves without relying either on the geodesic definition 
or on any flat surface properties. Maybe these ideas can be used to construct new 
Teichmiiller curves. Given the length of the paper, we simplify our task and prove 
the following results only for fundamental discriminants D. With appropriate care, 
the proofs can certainly be adapted to the general case. 

• The vanishing locus of T)9 is transversal to one of the two natural foliations of 
the Hilbert modular surface Xd (Theorem 12.1). 

• The vanishing locus of D9 is disjoint from the reducible locus (Theorem 12.2). 

On the compactified Hilbert modular surface, the reducible locus and the vanish¬ 
ing locus of a Hilbert modular form always intersect and the number of intersection 
points is proportional to the volume of the Hilbert modular surface. So the claim 
is that all these intersection points lie on the boundary of the Hilbert modular sur¬ 
face, hence at cusps of the vanishing locus of D9. While for the second statement 
we also have a proof using theta functions, we give proofs of both these statements 
relying on the following description of the cusps. 

• The cusps of the vanishing locus of D9 are in bijection with pairs consisting 
of a standard quadratic form ax^ + bxy + cy^ of discriminant D and a class 
r G Z/gcd(a,c)Z (Theorem 10.1). 

Here an indefinite quadratic form ax^ -I- bxy + cy^ is called standard if a > 0 > c 
and a-|-6-|-c<0. As a statement about cusps of Teichmiiller curves, this result 
already appears in [25] and [2]. Our proof, however, is completely different. We now 
explain the main idea. Suppose that a power series / = X) tis-® to vanish on 

a branch of a curve parameterized by <71 = and <72 = After these 

parametrizations are plugged into /, the lowest order exponent (in q) has to appear 
twice, since otherwise the terms cannot cancel. In the concrete case of / = T)9 we 
are led to the following notion. Given a fractional 0 d ideal a, we say that a primitive 
element a G (a^)^ is a multiminimizer for a if the quadratic form x 1 —>■ tr(Q;a;^) on 
a takes its minimum on one of the three non-trivial cosets ^a/a at least (and then, 
as we show, exactly) twice (with x and —x not distinguished). We show on the 
one hand that multiminimizers for a are in bijection with standard quadratic forms 
in the wide ideal class. (See e.g. [45] for the correspondence between ideal classes 
and quadratic forms.) On the other hand, on any branch of the vanishing locus the 
local parameter can be chosen such that a = ai = a{a 2 ) is a multiminimizer and 
that the multiminimizers (up to multiplication by the square of a unit) determine 
the branch uniquely up to an element of Z/gcd(a,c)Z. 

We have given the definition of multiminimizers and the description of branches 
of the vanishing locus in detail since this notion and construction reappears twice 
in the rest of the paper. First, multiminimizers appear prominently in the discus¬ 
sion of Bainbridge’s compactification below and, secondly, this description of the 
branches immediately gives the Fourier expansion of the modular embedding of 
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the uniformizing group of the genus two Teichmiiller curves for any D (see Theo¬ 
rem 13.1). Moreover, both “curious properties” mentioned in the case D = 17 hold 
in general. In particular, we have: 

• For any D, any cusp of the vanishing locus of 2)0 with corresponding Fuchsian 
group F and modular embedding cp, the i^-twisted modular forms of bi-weight 
{k,£) have a basis with Fourier expansions of the form X]n>o 

algebraic and A transcendental of Gelfond-Schneider type (Theorem 13.2). 

As another application of the description of Teichmiiller curves via theta deriva¬ 
tives, we give in Theorem 13.3 a description of the quadratic differentials on the 
leaves of the natural foliation of a Hilbert modular surface whose integral measures 
the flat distance between the two zeros of the eigenform for real multiplication. 
These quadratic differentials can be packaged together to a meromorphic modular 
form of weight (—2,4) that we give as the quotient of theta series and their deriva¬ 
tives. Our result has been used by McMullen [29] to describe the beautiful and 
complicated flat structure on the leaves visually (“snow falling on cedars”). 

Hirzebruch’s compactification and Bainbridge’s compactification. Hir- 
zebruch constructed a minimal smooth compactification of Hilbert modular sur¬ 
faces. His compactification is remarkable in many ways. First, it is the prototype 
of what is nowadays a called a toroidal compactification, i.e., it is given by a fan 
of decreasing slopes, periodic under the action of the squares of units. The fan is 
given for each cusp a of the Hilbert modular surface by the corners of the lower 
convex hull of (a^)^ in K.^. 

Bainbridge ([2]) observed that the closure in the Deligne-Mumford compactifi¬ 
cation of the Torelli preimage of Hilbert modular surfaces provides another com¬ 
pactification. This compactification is in general neither smooth nor minimal, but 
it was useful in his calculation of Euler characteristics of Teichmiiller curves. It is 
amusing to compare the two types of compactifications and to discover that they 
are parallel in spirit, but different in every concrete detail. 

• Bainbridge’s compactification is a toroidal compactification, given for each cusp 
a by the fan of multiminimizers (lying in (a^)^) for a. (Theorem 11.5). 

The second remarkable property of Hirzebruch’s compactification is that it can 
easily be computed using a continued fraction algorithm. 

• Hirzebruch’s compactification is driven by the “fast minus” continued fraction al¬ 
gorithm, while Bainbridge’s compactification is driven by a “slow plus” continued 
fraction algorithm. 

The reader will find the precise description of the algorithms in §11.2 and §10.4 re¬ 
spectively. The bijection between standard and reduced indefinite quadratic forms 
induces a subtle relationship between the number of boundary components of Hirze¬ 
bruch’s and Bainbridge’s compactification. In particular, the number of curves in 
the Bainbridge compactification of any cusp is always the same as the number for 
the Hirzebruch compactification of some cusp, but not necessarily the same one! 
The definitions and details, and several examples, are given in Section 11. 

Acknowledgements. The first named author is partially supported by the 
ERG starting grant 257137 “Elat surfaces.” He would also like to thank the Max 
Planck Institute for Mathematics in Bonn, where much of this work was done. 
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Part I: Modular embeddings and twisted modular forms 

The notion modular embedding in the sense used here appears for the first time 
in a paper by Cohen and Wolfart ([10]). They study holomorphic maps H —>■ H 
equivariant with respect to a Fuchsian group and its Galois conjugate. 

In particular Cohen and Wolfart show that all triangle groups admit modular 
embeddings. Subsequent work of Schmutz-Schaller and Wolfart ([38]) gave some 
necessary conditions for a group to admit a modular embeddding. Some Fuchsian 
quadrangle groups were shown in [36] not to admit modular embeddings, but it took 
more than a decade until new examples of modular embeddings were discovered. 

The first new examples arose from the Teichmiiller curves discovered by Calta 
and McMullen (see [ 8 ] and [23], and [30] for the modular viewpoint). All Teich- 
miiller curves give rise to modular embeddings. We summarize the known results of 
Teichmiiller curves (and thus the known groups admitting a modular embedding) 
briefly at the end of Section 1. 

Here, in Part I, we think of the group Fuchsian group F as given (e.g. in terms 
of a presentation) and study properties of the modular embeddings as holomorphic 
maps. We define an extension of the notion of modular forms to this context 
that we call twisted modular forms. The aim of the first part is to study this 
new object and to derive the analogues of the standard results on modular forms 
(Fourier coefficients, dimension, differential equations) from scratch in the context 
of twisted modular forms. 


1. Hilbert modular embeddings 


The term modular embedding is used in the literature both for equivariant maps 
from H —>■ (starting with [10]) and from —>■ Hg (already in [15]) . To 

distinguish, we call them “Hilbert modular embeddings” and “Siegel modular em¬ 
beddings,” respectively, according to the range of the corresponding map. We will 
be interested mostly in the quadratic case g = 2 and refer to [38] for basic notions 
is the general case. 

Throughout this paper we denote by AT a real quadratic field, with a fixed embed¬ 
ding K cM.. We use the letter cr to denote the Galois conjugation of K or the second 
embeddding of K into R, writing a{x) or x'^ interchangeably for x G K. By a Hilbert 
modular group for K we will mean any subgroup F^^ of SL(2,A') commensurable 
with SL(2, 0) for some order 0 C K. (Later we will make specific choices.) Such 

a group acts discretely and cofinitely on by {zi,Z 2 ) ^ a 2:2 + 

Here H denotes the upper half-plane. 


^czi + d ’ c'^Z2 + d° 


We will be interested only in Hilbert modular embeddings for which the first 
coordinate in is a local coordinate everywhere. A modular embedding of this 
type is described by the data (F, (/?), where 


• F is a subgroup of some Hilbert modular group F^f C SL(2, K) which, viewed as 
a subgroup of SL(2,]R), is Fuchsian (i.e., discrete and cofinite). 

• : H —>■ H is a holomorphic map satisfying (p o j o (p for all 7 S F. 


For such a pair (F, p), the map 2 : 1 — {z, p{z)) defines a map from the curve H/F to 
the Hilbert modular surface /Tk ■ 
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Written out explicitly, the condition on (p means that we have 

/az + b\ _ a'^ip{z) + b'^ 

^\cz + d) c‘^(p{z) + d‘^ 

for all (“ ^) € r and z gM. We will use this transformation property in §2 to define 
a 1-cocycle on F and hence a new type of modular form (“twisted” form) on F. Just 
as for usual modular forms, these must satisfy suitable growth conditions at the 
cusps of F, and to formulate these we need to know how ip behaves near the cusps. 
Assume first that one of these cusps is at oo, with the stabilizer of oo in the image 
f of F in PSL(2, R) being generated by ±( (, “) with a G K n R.+ . Then we have: 

Proposition 1.1. Suppose that the stabilizer of oo inV is generated by z z + a 
with a G K, a > 0. Then a’’ is also positive and p{z) has an expansion of the form 

g. oo 

p{z) = —z + VB„e(—) (V^Gll), (2) 

a ^ a 

n—O 

where e{x) := and where the coefficients Bn satisfy the inequalities 

\Bn\ < 29(i?o) foralln>l. (3) 

Proof. From (1) we have p{z -|- a) = p{z) + a'^, so the function p(z) — 
is invariant under z z + a and hence equals /(e(z/a)) for some holomorphic 
function f(g) in the punctured disc ID* = {g : 0 < l^l < 1}. Define a second 
holomorphic function F in D* by F(q) = e(/(g)/|a'*|). From V3(IH) C El we deduce 
that < 1 in ID*, where the sign is chosen so that ±a'* > 0. It follows 

that F(q) extends to a meromorphic function inD = {g: |g|<l} with at most 
a simple pole at g = 0. But then the fact that F has a single-valued logarithm 
in D* implies that its order of vanishing at 0 must be zero, so o'* must be positive 
and / extends holomorphically to the full disc and hence has a convergent Taylor 
expansion proving the first claim. For the second, we note first that 

the estimate |F’(g)| < |1/9| and the holomorphy of F’ at 0 imply by the maximum 
principle that |F’(g)| < 1 in the disk ID (this is just the Schwarz lemma, applied to 
the function gF(g)), so Bg has positive imaginary part and the function f{q)/Bg 
takes on values in the right half-plane. An elementary argument then gives the 
estimate \Bn\ < 2n'^{Bg). (Write /(g) = [Bg — Bg\{q))/{1 — A(g)) where A sends 
D to ID and 0 to 0; then Bn = 2z9(i3o) where [A'^Jn denotes the 

coefficient of g” in A(g)™, which is bounded in absolute value by 1 because A is.) 
The stronger estimate |i3„| < 2'Gs{Bg) follows from a theorem of Caratheodory [9], 
which says precisely that a holomorphic function mapping ID to the right half-plane 
and sending 0 to 1 has all its Taylor coefficients at 0 bounded by 2 in absolute 
value. □ 

Corollary 1.2. The imaginary part of p{z) satisfies the inequalities 
o®’ o'* 1 

—y < < —V + C{1 -b -) (4) 

a a y 

for all z = X + iy G M, where C is a constant independent of z. 

Proof. The first statement is just the inequality |F(g)| < 1 established in the course 
of the above proof, and because X^o 1 + 1/2/ the second follows from (3) 

□ 
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Exactly similar statements hold for all of the other cusps of E. Recall that 
by definition, such a cusp is an element v G whose stabilizer in T is 

infinite cyclic, say fi, = (± 7 ^). Choose g £ SL(2,K) with g(oo) = v. Then 
= ±(g “) for some positive element a of K. Equation (1) implies that the 
function (pg = o (j) o g satisfies Pg{z + a) = 'Pg{z) + , because {qI) = 

‘Pg9~^li^9 = = {9~hi^9T‘fg = {h°i)°‘Pg- 

Then the same proof as for the case ^ = Id shows that is positive and that 

fj 00 

•fgiz) =—z + ( 5 ) 


for some constants i3„ G C, satisfying the same estimate (3) as before, and for all 
z G H. Of course a and depend on v, and also on the choice of g, but the expan¬ 
sions for E-equivalent cusps are the same up to trivial rescalings, because p-yg = pg 
for 7 G E, so that there are only finitely many essentially distinct expansions. 

Another basic property of p, obtained by applying the Schwarz lemma to the 
map lip(z') 0730 /^^ : D —>■ ID, where la for a G H denotes the standard isomorphism 
(H, a) —>■ (D, 0) given hy z ^ (z — a)/(z — a), is that one has the inequalities 


p{z) - p(z') ^ z-z' \p{z) - V?(z 0 p ^ jz-z'l^ 

p(z) - p(z') ~ z-7 ’ Q(p(z))Q(p(z')) ~ Q(z)Q(z') 

or equivalently d(p(z), p(z')) < d(z,z') for any z, z' G H, where d : H x H —>• 
]R>o denotes the Poincare metric. (This is of course a standard property of any 
holomorphic map from the upper half-plane to itself.) Fixing z and letting 5(z') 
tend to infinity, or fixing z' and letting 3(z) tend to 0 , we obtain second proofs of 
the two inequalities in (4), while letting z' tend to z we obtain the estimate 


IX(.)I < 

S(z) 


(z G H). 


or equivalently |k(z)| < 1, where k : HI —>■ C is the map defined by 

k(^) = ■ 

Q(p(z)) 

From the equivariance property (1) we obtain the formulas 


t(p(z)) 


,az + b. 
'cz + d' 


c-/ / ,yaz + by (cz + df 

and these together with the standard formula 3 ( 7 z) = 9(z)/|cz -I- dp imply that 
the function k is E-invariant. We can therefore introduce a basic invariant A 2 = 
A 2 (E, p) of the pair (E, p) by 


( 9 ) 

dp imply that 


|7>'(z)p 


■ dxdy = 




vol(E\H) JJr\m //h dg 

where dp = y~^dxdy (with z = x + iy a.s usual) is the standard SL(2, ]R)-invariant 
measure on H, and where the integral can be taken over any fundamental domain for 
the action of E on HI and is absolutely convergent because |k| < 1. The invariant A 2 , 
whose values are always rational numbers, can be interpreted either as a ratio of 
the intersection numbers on the Hilbert modular surface with the line bundles L 


and L defined in §3, or as the second Lyapunov exponent ([33]), which explains the 
notation. It is a commensurability invariant of the Fuchsian group E. 
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We close this section by describing briefly the four known types of Hilbert mod¬ 
ular embeddings. The first two will play a role in this paper and will be discussed 
in more detail in Section 5. The other two are mentioned only for the sake of 
completeness. 

Type 1: Modular curves. Here r = r^ = { 7 Gri^| Hy = j'^A} where A 
is a suitable element of GL~^(2, K), and the map (p is defined by (p(z) = Az. 
The corresponding curves (1, 7 j)(IHI/r) C M?/T^ in this case are the irreducible 
components of the curves Tjv studied in [18] and [19] and reviewed in §5.2. In 
particular, there are infinitely many curves of this type on each Hilbert modular 
surface, and conversely each of these curves lies on infinitely many Hilbert modular 
surfaces in IHl 2 /Sp( 4 , Z). 

Type 2: Teichmuller curves. These are defined abstractly as the algebraic curves 
in the moduli space Mg of curves of genus g that are totally geodesic submanifolds 
for the Teichmuller metric. In genus 2, they always lie on Hilbert modular surfaces 
and (apart from one exception for the field Q('\/5)) have a modular interpretation 
as the components of the moduli space of genus 2 curves whose Jacobian has real 
multiplication by an order in a real quadratic field and such that the unique (up 
to a scalar) holomorphic form on the curve that is equivariant with respect to this 
action^ has a double zero. There are at most two curves of this type on each Hilbert 
modular surface and conversely each Teichmuller curve lies on exactly one X^. 
The proof that these curves have a modular embedding comes from [30]. (See also 
the proof of Proposition 5.6.) 

There exists a variant of these curves, not used in this paper but studied in 
detail by Weifi in [40], called “twisted Teichmuller curves,” obtained as the images 
of Teichmuller curves under the action of elements of GL{2, K)~^. They are still 
geodesic for the Kobayashi metric in , but no longer for the Kobayashi (= Teich¬ 
muller) metric in M 2 . There are in general infinitely many of these curves on any 
Hilbert modular surface. 

Types 3 and 4’ Curves related to Prym varieties. Recall that a Prym variety 
is the kernel of the map Jac(C') —>■ Jac(C'o) induced by a double cover C ^ Co 
of curves. By the Riemann-Hurwitz formula, it is 2-dimensional if and only if the 
genus g oiC lies between 2 and 5. For the cases 5 = 3 or 4, there is a construction of 
Teichmuller curves in the moduli space Mg corresponding to certain Prym varieties 
having real multiplication by an order in a real quadratic field [26]. Our cases 3. 
and 4. are these two cases, in the order g = 4, g = 3. 

The four types 1.-4. are distinguished by the invariant A 2 , which takes on the 
values 1 for Type 1, ^ for Type 2, -1 for Type 3, and ^ for Type 4 ([2], Theorem 15.1 
and [33], Proposition 5.1). For each of these values, there is an infinite number of 
commensurability classes of Fuchsian groups with this invariant. 

The exceptional Teichmiiller curve over Q(-\/5) corresponds to A 2 = 1/2, and at 
the time of writing this is the only known commensurability class of a Fuchsian 
group with this value of the invariant A 2 . 


= (itp for all fi in the order; cf. §5.1 (“first eigendifferential”) for details 
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2 . Twisted modular forms 


For any function (/? : H —>■ H satisfying (1) we can define two factors J{g, z) and 
J{g,z) for g G SL(2,iir) and z G IH by 

J{g,z) = cz + d, J{g,z) = c'^(p{z)+d'^ if 5 =(“&). (H) 

The calculation 

J{lil2,z) = {cia2 + diC2Y ip{z) + (0162 + ^ 1 ^ 2 )'^ 


a'^ip{z) + 6 ^ 




+ di) (cWY) +^2) 


(ci‘p( 


a2Z ■ 


' C2Z + d2 

= J(7i,72z)J(72,z) 


) + (c 2 '^(^) + d^) 

for 7i, 72 G r 


(by eq. (1)) 


shows that J is a cocycle for F. (The corresponding statement for J, which follows 
from the same calculation with (p = Id, is, of course, standard.) It follows that the 
map / I—>■ fl(k,£)ff of the space of holomorphic functions in El to itself defined for 
k, £ gZ and g G GL''"(2,iG) by 

if\ik,i)9){z) = J{g,z)-'^ Jig,z)-^ f{gz) (12) 

is a group action when restricted to F. 

We now define a ip-twisted modular form of bi-weight (fc, £) on F to be a holo¬ 
morphic function / : El —>• C satisfying f\(k 7)7 = / for all 7 G F together with 
the growth requirement that the function fg is bounded as 9(z) —>■ 00 for every 
g G SL(2,iG), where fg{z) = (cz -h d)~^(c'^(pg(z') -h d'^)~^f{gz). The function fg 
depends only on the coset Fg. 

To describe its Fourier expansion, we need to distinguish cases. If — I G F, then 
the modular transformation property implies that k -\- £ is even if Mk^t,{J£) is to 
be non-zero. If —1 ^ F, then there may exist non-zero twisted modular forms for 
both parities of A: -I- Let a as before be the totally positive element of K with 
= (±(() “)). Recall that the cusp goo is called irregular if F does not 
contain —1 and if (g^^Fg) = ( —(J “))■ Now there are two cases. If the cusp is 

regular or if fc -|- ? is even, then fg{z -\- a) = fg{z) and the Fourier expansion of fg 
has the form 

fg{z) =On e{nz/a) as 5(z) —>■ 00 , (13) 

n>0 

where n ranges over non-negative integers. Only if the cusp is irregular and k -\- £ 
is odd, then fg{z -\- a) = —fg{z) and the Fourier expansion is as in (13), but now 
with n ranging over Z>o -|- \ instead of Z>o. 

If the Fourier coefficient oq is 0 for all cusps (a condition that is automatically 
satisfied at regular cusps ii k 7- £ is odd), we call / a cusp form. The spaces 
of (/ 5 -twisted modular forms and cusp forms of bi-weight {k,£) will be denoted 
by Mk.e{T,(p) and Sk,e{r,ip), respectively. We will often omit the ‘V” when no 
confusion can result. 

Obviously, ordinary modular forms of weight fc on F are (^-twisted modular 
forms of bi-weight (fc, 0) for any (p, and in fact Mkp{T, (p) = Mk{T). We give three 
examples of twisted modular forms with £ ^ 0 . 
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i) We always have ip' G M 2 ,_ 2 (r, (/?), by virtue of the second equation in (9) and 
the expansions of ip at the cusps given in Section 1. This example shows that the 
weights k and ^ of a holomorphic twisted modular form do not both have to be 
positive. A similar example, which follows from the calculations given at the end 
of Section 4, is that 2(p'ip"' — Zip"’^ belongs to Ms,- 4 {T,(p). Note the quotient of 
this by ip'"^ is the Schwarzian derivative of tp. 


ii) In the case of modular curves, when the map : H —>■ H is given by a 
fractional linear transformation A G GL'^{2, K), the calculation 


J( 7 , z) = Az + d'^ = J( 7 '^, Az) 


j{rA,z) 

J{A, z) 


J{A'y,z) 

J{A,z) 


J{A,jz) 

J{A,z) 


J{ia) 


for 7 G r shows that if / belongs to i^), then the function fA{z) = 

J{A,z)~^f{z) belongs to Mk+e{r) in the usual sense, so that here we do not get 
a new kind of modular forms. For Teichmiiller curves, on the other hand, the 
automorphy factor J cannot be reduced to an automorphy factor of the classical 
sort, and the twisted forms are a genuinely new type of modular form. 


iii) If F/f is a Hilbert modular group containing F and A is a Hilbert modular 
form of weight {k,£) on Tk (i.e., F : —>■ C is a holomorphic map satisfying 

F( 72 : 1 ,7'^ 2 : 2 ) = (c 2 i +d)^{d^Z 2 + d'^)^F(2i, 22 ) for all 7 G F^ and all 21 , 22 G H), 
then the restriction of F to H under the embedding (I, (/?) : HI —>■ is an element 

of Mk,t{T,ip). 

The last example provides many twisted modular forms for any (F, 1 ^). But not 
all twisted modular forms arise in this way, and it makes sense to study the twisted 
forms independently of the two-variable theory. In particular, one can ask for the 
dimensions of the spaces Mk^t{T,ip) and Sk^iT^ip) and for the structure of the 
bigraded ring ip) = ^ (/?), just as is usually done for classical mod¬ 

ular forms when £ = 0, and we can also study the classical topic of growth of Fourier 
coefficients. We shall give a general formula for the dimensions in the next section 
and a description of the ring of twisted forms in a special example in Section 7, 
while the rest of this section is devoted to the study of the coefficient growth. 


Theorem 2.1. Let f{z) = ^anq" be a twisted modular form of bi-weight (k,£), 
and set K = k Then the Fourier coefficients of f satisfy the estimates 

{ q^^k/ 2) if k + £ <2 or f is a cusp form, 

0(n^/^logn) ifk + £ = 2, 

O(^fc-l-rmax(0.^)) ifk + £>2. 

Proof. Suppose first that / is cuspidal. Here we use a modification of the well- 
known argument given by Hecke in the untwisted case. We construct the real-valued 
continuous function 

F (. z ) = |/( 2)|//2 

where y = y{z) = 9 ( 2 ) and y = y{z) = S5((/9(2)). This function is F-invariant by the 
definition of a twisted modular form. Since / is a cusp form, F decays rapidly at 
cusps and hence is bounded on the whole upper half plane, so f{z) = 

On the other hand, ^ Jq f{x + iy)e{—n{x-\-iy)/a) dx for any y. Specializing 
to y = 1/n and using the first or the second inequality in (4) depending on the sign 
of £, we obtain the estimate stated. 
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In the remaining cases, still 


< 


a 


pOi poc 

/ F{x + i/n)n^^‘^ -\- l/n))~^^'^dx <C / F{x + i/n) dx, 

Jo Jo 


by (4) (where the constant implied by <C depends only on F), but now F(z) = 
F{'-f{z)) = 0{F[{z)J^'^^'^/'^) instead of 0(1), where H{z) is defined in the lemma 
below. Since the exponent k — 1 + max(0,^) is equal to iF/2 + {k + £)/2 — 1, the 
remaining statement is precisely the content of the following lemma. □ 


Lemma 2.2. Let T be a non-cocompact Fuchsian group, with the width of the cusp 
at oo equal to 1, and define the height function H{z) = max.ygr ^( 72 ). Then for 
n > 1 and A > 0 one has the estimates 

1 [0(1) z/0<A<l, 

/ H{x + fi)^dx= <0(logn) i/A = l, 

[O(n^-i) */A>l, 

where the implied constant does not depend on n. 


Proof. The case A = i is [37], Proposition 2.2. Essentially the same method can be 
used to give all cases. We provide the details only for A > We let T = (), [) and 
choose for each of the h cusps pj of F a matrix Nj G SL(2, R) such that Njrjj = 00 , 
and such that the stabilizer of 00 in Fj = NjVNj^ is always (T). Let T be a closed 
fundamental domain for F, which we may choose so that the cusp neighborhoods 
have the shape 


iV,(T)n{CGH:S(z)>B} = [0,l]x [B,oo) (j = l,...,/i) 

for some B > 1, and are disjoint. We define the truncation function \x\b to be 
a; if a; > i? and 0 otherwise. Since the complement in the fundamental domain of 
the cusp neighborhoods is compact, it suffices to prove the statement of the lemma 
with \H\ B in the place of FI. Note that 

/ lHix+^)\Bdx = [ LS(7(a; + 7 ))^ da; • (14) 

a=i7e<T)\r,-do 


Suppose that 7 = (“ [)) gives a non-zero contribution to the right hand side. Recall 
that Shimizu’s Lemma states that in a Fuchsian group, normalized so that the 
cusp 00 has width 1, either c = 0 or 1 < jcj. Here the truncation implies that c ^ 0 
and that 

i<-—, 

■ IWi' + i) + <'F 

from which 

1 < |c| < -/n and — - G [— 1 , 2 ]. 

c 

On the other hand, with the substitution x = -k we get 


+ i))\B dx < j 


1 /r 


{cx -I- dfi -\- (? jnf 


dx 


/■“ dt 

^ 7-00 + 1 )^ 
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(It is this estimate which has to be changed, taking into account [ • Jb, when 
A < 5 .) We define 

ClAx,X) = { 7 =(-^)G(T)\r, : x<|c|<X, 

and The crucial observation now is that the cardinality of 

this set is bounded for Af > 1 , any j and ^ by 

^Cl,{X)<{v-^x)X^ + l. 

This again follows from Shimizu’s Lemma, which implies that for two matrices 
7 = (“ ^) and y = (“' ^;) in Cl,{X) we have \^ - i\> \cc'\-^ > X-^ (see [ 21 ], 
Proposition 2.8, for details). 

If A = 1, then the contribution of the j-th cusp to the right hand side of (14) is 
bounded above by a constant times 


E 

iGCii 2(V") 



< 


i logj n 

E E 

fe=0 T-eCii 2(2'“"b2'‘) 



< 


5 log2 n 

E 


3-22fc 

22 fc -2 


O(logn). 


The other cases with A > 1/2 are calculated the same way, the estimate for the 
left-hand side of (14) now being iog 2 n which is 0(n'^“^) 

for A > 1 and 0(1) for A < 1. □ 


The bounds in the preceding theorem are not sharp for the classical case £ = 0, 
since the methods of Rankin and Selberg give an improvement for any Fuchsian 
group. Note, however, that the deeper results of Deligne cannot be applied here, 
since on Teichmiiller curves one does not dispose of Hecke operators. 

In the more interesting case of strictly twisted modular forms (i.e. £ ^ 0) we 
know in the example of ^p' that the coefficients grow like 0{n) by Caratheodory’s 
estimate (3), while the preceding theorem with k = 2, £ = —2 gives only the bound 
Oiri?). However, we do not know if the behaviour of the coefficients of ip' is typical 
for a twisted modular form of bi-weight ( 2 , — 2 ) or if there exist other elements 
whose coefficient growth is closer to O(n^). 


3. Dimensions and degrees 

Twisted modular forms of bi-weight (fc, £) can be thought of as sections of the 
bundle with appropriate growth conditions at the cusps, where L and L 

are the line bundles over H/T defined as the quotients of H x C by the equivalence 
relations {z,u) ~ {jz, J{'y, z)u) and (z,u) ~ z, J{'y, z)u) for 7 G F. The dimen¬ 
sion of the space of such forms for a given group can therefore be computed by the 
Riemann-Roch theorem for curves, just as in the case of classical modular forms, if 
we know the degrees of the two bundles L and L and the numbers of cusps and ellip¬ 
tic fixed points of various orders of the group. In particular, if F had no cusps and no 
fixed points (a situation of which, so far as we know, there is no example), then the 
Riemann-Roch theorem would give dimMfc 7 (r) = {kdeg{L) +£ deg(£)—1) {g — 1), 



THETA DERIVATIVES 


15 


where g is the genus of H/F. The presence of cusps (including possibly irregular 
ones) and elliptic fixed points will make the actual formula more complicated. 

Let n be a torsion-free subgroup of finite index in F. Such a group always exists, 
since the level three subgroup of a Hilbert modular group is torsion-free and F is 
a subgroup of the Hilbert modular group. We define Hq to be a subgroup of finite 
index such that the eigenvalues of all parabolic elements are one (i.e. all cusps are 
regular). Such a subgroup exists, since H is free if it is has a cusp. By passing to 
a smaller subgroup if necessary, we may suppose Hq C F to be normal. We let Lq 
and Lq be the line bundles over H/Ho defined by the automorphy factors J and J 
respectively. 

The basic invariant we attach to a Hilbert modular embedding is the ratio 

A 2 = deg(Zo)/ deg(£o). (15) 

As a consequence of the proof of Theorem 3.2 below, we see that this number does 
not depend on the choice of Hq among torsion-free subgroups of F with regular 
cusps. The value of the invariant A 2 is given by 1, 1/3, 1/5, 1/7 on the four classes 
of Hilbert modular embeddings described in Section 1. 

The definition of A 2 and the classical result that ordinary modular forms of 
weight 2 are differential forms on H/Ho imply the following Proposition. 

Proposition 3.1. Let go and sg denote the genus and the number of cusps of 
H/Ho, respectively. Then 

deg(£o) = 5 o - 1 + so /2 and deg(Zo) = A 2 (go - 1 + so/ 2 ). (16) 

Now we have to pass from Hq to F, which may have both elliptic fixed points 
and irregular cusps. If F contains —then we will assume that k + £ is even, since 
otherwise the equation /|(fe/)(—/) = —/ implies that the space of twisted modular 
forms of bi-weight {k,£) is 0 . 

We define characteristic numbers at elliptic fixed points and cusps for the bundle 
of twisted modular forms of bi-weight {k, £) in the following way. Suppose that x 
is an elliptic fixed point and that the isotropy group F^, is of order Ux in SL(2,IR). 
We take a generator 7 =(*J)GFa; that acts on the tangent space at x by a 
rotation by ‘I'njux in the positive direction, i.e. such that arccos(tr( 7 )/ 2 ) = ‘I'njnx 
and c svaffl'Kjux) < 0. We let Since 7 is of finite order, so is 7 '^ and we 

can define rZ £ — Z by 

cos(27rr^) = tr( 7‘^)/2 and c'^ sin(27rr^) < 0. 

Then the characteristic number at x is defined as 

bx(k,£) = {-krx - £rx} , (17) 

where the curly braces denote the fractional part (in [ 0 , 1 )) of the rational number. 
If X is a cusp, we define the characteristic number bx{k,£) to be 1/2 if the cusp is 
irregular, —I ^ F, and k + £ is odd, and we let bx{k, £) = 0 in all other cases. 

We remark that characteristic numbers are a finer information than the usual type 
of the elliptic fixed point ([39], Section 1.5 and V.7), since there are two possibilities 
even for fixed points of order two (in PSL(2,]R)). We let A = and S = 

G SL(2,0£)). Then the contribution of AS'A”^ G SL(0/)©0 d) is 6a;(A:, £) = 
{ ~ 4 ^^ }’ whereas the contribution of S is bx{k,£) = { ~ 4 "^ } ■ Note that in all 
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these calculation we consider modular embeddings to If we consider a modular 
embedding to H x El”, the fixed point of S is —i in the second factor, so its 
contribution is £) = { ~ 4 ^^ }■ 

We can now give the dimension of the space of modular forms in terms of the 
topology of El/r and those characteristic numbers. We let be the order of the 
isotropy group Fa, in PSL(2,R). 

Theorem 3.2. Let k and £ be integers. If k + £ is odd and T contains —I, then 
= 0. If—I ^ r or k+ £ is even, then twisted modular forms of 
bi-weight (k,£) are precisely the global sections of a line bundle Lk,( of degree 

degiLkj) = ik-£X2£)(g-l + ^ + ^ ^ (l - ^)) - 

a:GH/r ^ sGH/E 

where g and s denote the genus and the number of cusps ofT. If also k + A 2 ^ > 2, 
then 

dim Mkj(T,(p) = deg{Lk,e) + (1 - 5 ) ■ 

Proof. We mimic the standard argument for ordinary modular forms and describe 
Lk,e as a subsheaf of If t is a local parameter at x, for x both in EI/F or 

being a cusp, the stalk {Lk,e)x at consists of all germs of holomorphic functions / 
with /(yt) = J( 7 , t)* J( 7 , yj(t))^/(<) for all 7 G Fj, in the stabilizer of x. With this 
definition, twisted modular forms of bi-weight (fc, £) for the modular embedding p 
are obviously the global sections of £^ 7 . 

In order to compute the degree of £^7 we use the map tt : EI/IIo —>■ H/F. This 
induces an inclusion TT*{Lk,e) —t Since we know the degree of the image 

in terms of g, s and A 2 , it suffices to compute the degree of its cokernel OC. This 
cokernel is supported at the elliptic fixed points and at the cusps. 

Suppose first that 2 :.^ G El is the fixed point of 7 G F. The point p{z-f) is fixed 
by 7 '^, so that by the cocycle condition both and J{'j, p{z-f)) are roots of 

unity of some order that divides the order of the isotropy group Ux. 

Now let y be one of the preimages of x and let m be a local parameter at y, so 
that t = w”®. Then 

(4Z^o)y = C[M] and {tt*{L k,e))y = C[[u]], 

where Bx(k,£) G [0,1) is the unique rational number such that 
J(-i,z^fj{-i,p{z-^)Y = e{Bx{k,£)). 

Consequently, dimDCy = nLBx{k,£) for each of the deg( 7 r)/n;;i points y above x. 

Next we want to show that Bx{k,£) = bx{k,£). li x = i, then the generator of 
F^ specified above is 7 = co"( 2 v”*)) J{ji,i)^ = e{-krx) 

and J{ji,iY = e{—£rf.) by definition of rx and rf. This proves the claim in the 
special case x = i. For the general case note first that for any a G SL(2,]R) the 
cocycle property implies J{'^,z-f) = J{a'ya~^,Za-ya-Y^ where Za.^a-i = az-^ is the 
fixed point of The equivariance property (1) implies that p{z-f) is the fixed 

point of 7 '^ and hence .J('y,Zj) = J, z-f<^). If a takes the fixed point of 7 to i, 
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then Qf'^ takes the fixed point of 7*^ to i and so 

= J{aja, iY ,iY = Jiaja, iY J{{a'yaY, iY 


reduces to the case already considered. 

Now suppose that a: is a cusp and let y be one of the cusps of IIo above x. If 
denotes the degree of the covering tt at y, then there are deg( 7 r)/n^ cusps above x 
since tt is Galois. 

We start with the case —I ^ F. Then the stabilizer F^; is infinite cyclic. Let 7 
be a generator. The same argument as for ( 18 ) allows us to assume that the fixed 
point Zj = 00. Note that 7'^ also fixes 00, so that J(7, 00) = J(-y,(p{(X))) = 72,2, 
the lower right entry of 7. Since the cusp is irregular if and only if the generator 7 
has 72.2 = —1, we deduce 

J{l,ooY J{l,(p{oo)Y = e{bx{kY)) 


for bx{k,£) defined above. On the other hand, let a be the width of the cusp 00 of 
Fq, so that q = e{z/a) is a local parameter at y. Then 

- C[[g]] and (7r*(£M)y = ^[[9]], 

so that in total dimdCy = n^bx{k,£)- 

With the same local calculation one checks that if —I G F always 7 T*{£jk,i)y — 
(£§£0)^. Hence in this case, too, dirndC^ = n^bx{k,i) = 0 holds by definition. 
Altogether, this implies 

deg{Lk,i) = degV) {Y + ^2^) deg(£o)) - bx{k,£)- 

kGH/T 


The number of cusps of F is sq = deg(7r) 
Riemann-Hurwitz formula 

9(H/no) - 1 


deg( 7 r) 


= g(H/F) - 1 + - 


=. Together with ( 16 ) and the 

o-=) 


cGH/T 


this implies the degree claim. The dimension statement then follows from Riemann- 
Roch since the iF^-term vanishes for deg(£fc7) > 2 g — 2 , which is guaranteed by the 
hypotheses on fc + A2F and the fact that bx{k,£) < 1 — ^ for all cusps and elliptic 
fixed points. □ 


4. Differential equations coming from twisted modular forms 

A basic fact about modular forms, whose proof will be recalled below, is that 
for any Fuchsian group F C SL( 2 ,]R), any modular function t on F and any mod¬ 
ular form / of integral weight fc > 1 on F, the function y(t) defined locally by 
f{z) = yit{z)) satisfies a linear differential equation of order k + 1 with alge¬ 
braic coefficients (and even with polynomial coefficients if H/F has genus 0 and 
t is a hauptmodule^). In this subsection we prove the corresponding statement 
for twisted modular forms. This statement will give one of the two approaches 

^Recall that a “hauptmodule” (or “Hauptmodul” if one retains the German spelling) is a 
modular function t giving an isomorphism between H/F and P^(C) if the former has genus 0. 
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used in this paper to describe Teichmiiller curves explicitly on Hilbert modular 
surfaces, by comparing the differential equations coming from their geometric defi¬ 
nition (Picard-Fuchs differential equations) with the differential equations satisfied 
by suitable twisted modular forms on them. 

Theorem 4.1. Let f{z) be a twisted modular form on (F, ip) of bi-weight {k,i), with 
k, £ > 0, and t{z) a modular function with respect to the same group F. Then the 
function y{t) defined locally by f{z) = y{t{z)) satisfies a linear differential equation 
of order {k 1){£ 1 ) with algebraic coefficients. 


Proof. It suffices to prove this for the two cases {k,£) = (1, 0) and (0,1), since the 
general case follows from these. (The number {k -I-1)(^ -I-1) arises as the dimension 
of Sym’^(Vi)( 8 )Sym’^(V 2 ) where dim Vi = dimV 2 = 2.) The hrst case is the classical 
theorem mentioned above, of which several proofs are known (see e.g. §5.3 of [7]). 
We reproduce one of them here since it generalizes directly to the more complicated 
case of bi-weight ( 0 , 1 ). 

Let, then, f(z) be an ordinary modular form of weight 1 and t{z) a modular 
function on F. By dehnition we have the two transformation equations 




az -\-b\ 
cz dJ 


= t{z), 




az b\ 
cz dJ 


(cz -h d) f(z) 


for all matrices (“ ^) G F. Differentiating these equations gives the further trans¬ 
formation equations 



(cz -b dft'{z), 

(cz -b dff'(z) + c(cz + df f(z ), 

(czd)^ f"(z) -b 4:c(czd)^ f'(z) -b 2c^(czd)^ f (z). 


The first of these equations says that t' is a (meromorphic) modular form of weight 2, 
and by combining the others we find that the expression 2 /' — ff" is a modular 
form of weight 6 . It follows that 


t'(z) 

fiz^ 


A{t(z)), 


V'(zf - f(z)f"(z) 
t'(z)f(zY 


B{t(z)) 


(19) 


for some rational (or, if t is not a hauptmodule, algebraic) functions A(t) and B(t). 
A direct calculation shows that 


1 

T 



, y' - ff" . _ n 

+ f 4 I ) 


t'f 


It follows that the function y(t) defined parametrically by the equation y(t(z)) = 
f(z) (which of course can only hold locally, since t(z) is F-invariant and f(z) isn’t) 
satisfies the second order linear differential equations 


{A(f)y'(f))' -b B(f)y(f) = 0, (20) 


or Ay" -b A'y' -b By = 0. This proves the theorem in the case (fc, £) = (1,0). 

Now suppose that / is a twisted modular form of bi-weight (0,1), i.e., / satisfies 
the transformation equation = (c"‘p(z) -\-d'^)f(z) for (“()) G F. From 
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this equation, and from equation (1) and its derivative (= second equation in (9)), 
we find by further differentiating the transformation equations 

b\ {cz + d)'^ „ 2c{cz + d)^ 


‘P 


. /az + b\ 
\cz + d) 


{c'^(p{z) + d'^Y 
2c'^{cz + dY 


{d^Lp{z) + d'^Y 


p"{z) + 


{d^(p{z) + dY‘- 




= (cz + dY{c'^p{z) + dYf{z) + cYcz + drY{z)fiz), 
= {cz + dY{c'^(p{z) + d'^)f"{z) 

\cz a/ 

+ [2c{cz + dY{c'^ip{z) + d'^) + 2c'^ {cz + dY(p'{z)] f(z) 
+ \c'^{cz + dYif"{z) + 2cc‘^(cz + c?)^i^^(z)] f{z). 


From these equations it follows that the combination (2/'^ — f f")Y + //V" is a 
modular form of weight 6. But we have already seen that t' and Y are twisted 
modular of bi-weights (2,0) and (2,-2), respectively. It follows that 


iV'izf 


t'{z) 


p'{z)f{z? 

f{z)nz))p'{z) + f{z)nz)Y'{z) 

t'{z)ip'{zYf{zY 


A{t{z)), 

B{t{z)) 


( 21 ) 


for some algebraic (resp. rational if t is a hauptmodule) functions A{t) and B{t), 
and since by direct calculation we have 

1 / t' 1 .\' (2f' - ffW - //V" f ^ n 

in this case, we deduce that / satisfies a second order linear differential equation of 
the same form (20) as before. □ 


Remark. The two weight 6 modular forms 2/'^ — //" (for / G Mi_o(r)) and 
( 2//2 _ _|_ ff'ip'' (for / G Mo,i(r)) used above, which are easily checked 

to be holomorphic at the cusps, are special cases of the classical Rankin-Cohen 
bracket and of twisted versions of it, respectively. Without going into details, we 
mention that the twisted Rankin-Cohen brackets of two twisted modular forms 
fi G Mfe.,^.(r, (/?) [i = 1, 2) can be defined as the product of the usual Rankin- 
Cohen brackets of ip' ' fi and ip' /2 (which by example i) of Section 2 are 
ordinary modular forms of weight ki -f (.i and ^2 + ^2 on F) with a suitable power 
of ip'. 
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Part II: Modular embeddings via differential equations 

In Section 4 we have seen abstractly how classical or twisted modular forms give 
rise to differential equations. In Part II we show conversely, in a specific example, 
how to obtain from these differential equations the Hilbert modular embedding ip. 
The example that we will consider in detail is D = 17, for which the differential 
equations needed were computed in [5]. In Section 6 we will sketch how these were 
obtained, referring to that paper for the full details. In Section 7 we discuss the 
arithmetical properties of the solutions of these differential equations and compute 
the Fourier expansions of the corresponding modular forms at all cusps. We turn 
Theorem 3.2 into a concrete description of the ring of modular forms (Theorem 5.5 
for the general result and Propositions 7.1 and 7.2 for the special case D = 17), 
since the corresponding local invariants can be computed for Teichmiiller curves. In 
Section 8 we will show how to use these solutions to obtain an explicit embedding of 
the Teichmiiller curve in the Hilbert modular surface. The introductory Section 5 
provides the necessary background on Hilbert modular surfaces and Teichmiiller 
curves in genus 2. 


5. Curves on Hilbert modular surfaces 

As we have already said, there are two basic examples of the situation described 
in Section 1: modular curves and Teichmiiller curves. In this section we describe 
both of these, the first relatively briefly since it is well known and the second in 
more detail. We begin with a preliminary subsection specifying more precisely the 
Hilbert modular surfaces that will be used in this paper. The main new result in 
this section is the dimension formula Theorem 5.5. 

5.1. Hilbert modular groups and Hilbert modular surfaces. As before, we 
denote by AT be a real quadratic field, together with a fixed embedding AT C R, and 
denote by a both the Galois conjugation and the second embedding of K into M. In 
§1 we briefly defined Hilbert modular groups and Hilbert modular surfaces, denoting 
them generically by P^ and 'B?/Tk- Now we want to be more specific. Our general 
reference are Hirzebruch’s seminal paper [17] and the book [39] by van der Geer. 

Usually when one speaks of “the” Hilbert modular group for K one means the 
group SL(2, Od), where D is the discriminant of an order 0 = Od C AT. However, 
since we want principally polarized abelian surfaces, we need to work instead with 
the modified Hilbert modular group 

SL(0^©0) = o'") n SL(2,K), 

where 0^ denotes the set of x G AT for which xy has integral trace for all j/ G 0. 
One has 0^ = 0“^, where in the case of quadratic fields the ideal 0, called the 
different of AT, is simply the principal ideal {sTD) = VDO. Note that the two 
groups SL(2, 0) and SL(0^ © 0) are conjugate in GL(2, K) by the action of the 
diagonal matrix A = (J and in particular are isomorphic as abstract groups. 
But the action of A interchanges the upper and lower half-planes in the second 
factor (since the Hilbert modular group acts on the second factor via its Galois 
conjugates and the Galois conjugate of the determinant of A is negative), so the 
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quotient is the Hilbert modular surface 

Xd =HVSL(0^©0), 

which is isomorphic to Xq = H x IHI“/SL(2, 0) and not in general isomorphic to 
the “standard” Hilbert modular surface Xq = IH^/SL(2,0). (They do not even 
necessarily have the same Euler characteristic.) If 0 contains a unit e of negative 
norm, which happens, for instance, when D is prime, then D is principal in the 
narrow sense and the varieties Xq and Xq are isomorphic via (zi, Z 2 ) >—>■ (szi, £'^^ 2 ). 

To a point z = (zi, Z 2 ) G we associate the polarized abelian surface = 
C^/£z, where £z C is the lattice 

£z = {(o^i + b, (i^Z2 + I CL G 0^, b G 0)} 5 (22) 

with the action of 0 on induced from the action A(ui,V 2 ) = (Aui, A'^'y 2 ) of 0 
on and with the polarization induced from the antisymmetric pairing 

{{a,b), {a',b')) =tTx/Q{ab' - a'b) (a, a'G (0^)“\ b, b'€ 0). (23) 

This pairing is unimodular and the polarization is principal, which is why that case 
is of special interest. 

We observe that the action of 0 on Hz gives a canonical splitting of the 2- 
dimensional space of holomorphic 1-forms on A into two 1-dimensional eigenspaces, 
generated by the differential forms w = dvi and uj = dv2, which we will call the 
first and second eigendifferential, respectively. If Hz is the Jacobian of a curve C of 
genus 2, then by the canonical identification of the spaces of holomorphic 1-forms 
on C and on Hz we obtain corresponding eigendifferentials on C. These will be 
used in the definition of Teichmiiller curves in §5.3. 

Since the isomorphism class of Hz depends only on the image of z in Xn, and 
since polarized abelian surfaces are parametrized by points in the quotient of the 
Siegel upper half-space H 2 by Sp(4, Z) we get an embedding of the Hilbert modular 
surface X^ into the quotient H2/Sp(4,Z), a Siegel modular embedding. Explicitly, 
let Ip be the map from to H 2 given by 

Ip : z = (zi,Z 2 ) z^2) 

where, for some Z-basis ( 011 ,^ 2 ) of 0 we let 

B=h and A = B-K (25) 

\^2 W 2 J 

We define a homomorphism ih : SL(0^ © 0) —>■ Sp(4,Z) by 


4 ' : 


fa b\ (B^ 0\ /a b\ (A^ 0\ 

Vc dj ^ 1 0 hJ l^c dj V 0 W ’ 


(26) 


where d for a G K denotes the diagonal matrix diag(a, a'^). Then the map is 
equivariant with respect to the actions of SL(0^ © 0) on and of 'I'(SL(0'' © 0)) 
on IHI 2 , so it induces a map, also denoted by i/f, on the level of quotient spaces. 

Remark on notation. We will use the letters ip and for Siegel modular 
embeddings and (p and for Hilbert modular embeddings. The capital letter will 
denote the map on the level of modular groups and the small letter the map on the 
level of symmetric spaces or quotient spaces. 
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More generally, for any invertible 0-ideal a the trace pairing (23) on the lattice 
© a is unimodular and consequently, the abelian surface £z,a, defined as in ( 22 ) 
with 0 replaced by a, is principally polarized. This implies that the Hilbert modular 
surfaces 

Xu a = ]HI/SL(a^ © a), 

where 

SL(a^©a) = ) nSL(2,iO), (27) 

also parametrize principally polarized abelian varieties with real multiplication 
by 0. The only difference is that now the cusp at oo of Xu,a is in general a 
different one than the cusp at oo for Xu- We will use these variants Xu.a when 
we discuss cusps of Hilbert modular surface in Section 10. If we construct B us¬ 
ing some Z-basis {uji,uj 2 ) of a, then (24) defines a map that is equivariant with 
respect to a homomorphism 

: SL(a''©a) ^ Sp(4,Z) (28) 

given by the same definition (25). Hence the pair {-tp, ih) defines a Siegel modular 
embedding of Xu.a- 

5.2. Modular curves. We already defined the modular curves in Section 1 as the 
quotients of H by subgroups of SL(2,]R) of the form T^ = {7 G T^ | ^7 = 7 '^^} 
where A is “a suitable element” of GL(2,iG), embedded into appropriate Hilbert 
modular surfaces via z 1 — (z,Az). Here “suitable” means that the adjoint of A 
equals its Galois conjugate, so that ^ = (some (a, 6, A) G Q x Q x K, 
which after multiplying H by a suitable scalar in (which does not change the 
definition of F^) we can assume belongs to ZxZx Ok■ The corresponding embedded 
curve in is defined as aziZ 2 + i^'^zi + vz 2 + 6 = 0, where v = X/Vd g 0^ = 5“^, 
and the union of these curves (or rather, of their images in the Hilbert modular 
surface) when A ranges over all matrices as above with given determinant N G N 
is denoted by Tk- These curves Tjv were studied in detail (for the Hilbert modular 
surface Xq) in [18] and [19]. We recall a few results that we will use. The curve Tn 
is non-empty if and only if N is congruent modulo D to the norm of an element of 0, 
and is non-compact (i.e., passes through the cusps) if and only if N is the norm of 
an integral ideal a of K, in which case each of its components is non-compact. It 
is not in general irreducible, for three reasons. First, we have Tk = 
where Fjv is defined like Tk but with the additional requirement that (a, 6 , v) is 
primitive in the lattice Z x Z x c)~^. Secondly, the Fjv are in general not irreducible 
either, but decompose as where a ranges over the elements of c )”^/0 

with iV(a) = N/D (mod 1) and F]\[{a) is defined by requiring iz = a (mod 0) ([43], 
p. 4, Remark I). Finally, even the Fjv(a) need not be irreducible. (For instance, 
if H = p is prime and p^\N, then the two Legendre symbols (a/p) and ( 6 /p) 
cannot both vanish or have opposite values, so Fjv = Fjv(O) has two components 
distinguished by the invariant £ G {±1} defined by (a/p) = e or ( 6 /p) = e; cf. [13].) 
However, if N is “admissible” in the sense of the proposition on p. 57 of [19] (i.e., 
N is the norm of a primitive ideal in the principal genus), then that proposition 
says that each FN{a) is irreducible, as one shows by counting the number of cusps 
of each component separately and of the whole curve Fjv ■ 
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The same results apply to the curves in defined (and denoted) in the same 
way but with the determinant of A now being —N rather than N. Using the 
identification (zi, Z 2 ) '—>■ {VDzi, —^/~Dz 2 ) of Xq with we can consider these as 
curves va Xd, the defining equation now being 

ADZ 1 Z 2 + yzi + Az2 + S = 0 {A, B gZ, XgO, ABD - AA'^ = N). (29) 

A special union of these curves will be play a role a in characterization of Teich- 
miiller curves below. In the moduli space of principally polarized abelian surfaces 
A 2 we denote by P the product locus (also called reducible loeus), i.e. the locus of 
abelian varieties that split, as polarized abelian varieties, into a product of elliptic 
curves. The Torelli map gives an isomorphism 

t : 3 VC 2 —^ A 2 \ P • 

The intersection of P with the Hilbert modular surface Xd will be denoted by Pd- 
It is a union of modular curves, as described in the following proposition. 

Proposition 5.1. The decomposition into irreducible components of Pd is given 
by 

Pd = [J Pd,u = [J Pp r+VD , 

Tr(i^) — 1 r=D (mod 2), \r\<2y/D 

where Pd,v is the image in Xd of the curve (v, C . 


Proof. This is essentially Corollary 3.5 of [27], which states that 


Pd 


U 

N,r&, N>0, D=r^+4N 



r + v^\ 


(30) 


Since each N occurring is admissible (it is the norm of the primitive principal ideal 
generated by we have that Tm{v) = Fn{v) is irreducible for each v = 

and hence coincides with its subcurve Pd,u- D 


We would like to say a few words to explain where the equation D = r^ + AN 
in (30) comes from. A point of Pd corresponds to a product ExE' oi elliptic curves 
having real multiplication by 0, i.e., for which there is an endomorphism $ = ( “ ^ ) 
oi ExE' satisfying a quadratic equation of discriminant D over Z. Since for generic 
points the elliptic curves E and E' do not have complex multiplication, we have 
a G End(A) = Z, b G End(P') = Z, and AA' = A'A = N with N = deg A G N and 
hence <I>^ — (a + 5)$ + (ab — N) = 0, D = (a + b)^ — 4(ab — N) = {a — 6)^ + AN. We 
should also mention that the statement Pd Q special case 

D' = 1 oi the general statement that the intersection of two Humbert surfaces TCd 
and TCd' in the moduli space A 2 = ]Hl2/Sp(4, Z) is contained in the union of Tv 
with DD' = + 4A^ for some r € Z, IV G N. This statement is well known, and is 

given implicitly in the proof of Prop. XI.2.8, p. 215, of [39], but since we could not 
find a convenient reference and since the proof is easy, we give it here. We recall 
(cf. [39], Chapter IX) that the Humbert surface TCd is defined as the image in A 2 
of the union of the curves 

{( ) G EI 2 I an + 6 t 2 + era + fi(r| - nra) + e = O} (31) 

with (a, b, c, d, e) G Z®, b^ — Aac—Ade = D. If H is a fundamental discriminant, then 
TCd is irreducible and hence can be given by any one of the equations in (31). The 
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locus of products of elliptic curves in Ai is dfi, because the standard embedding 
(]HI/SL 2 (Z))^ —>• ]Hl 2 /Sp 4 (Z) is given by the equation T 2 = 0 in ]Hl 2 , which has 
the form (31) with {a,b,c,d,e) = (0,1,0, 0,0), 6 ^ — 4ac — 4de = 1. The Hilbert 
modular surface Xd can be identified with 'Kd, because if we write N(xuJi +yuj 2 ) = 

Ax'^ + Bxy + Cy'^ {A, B, C G Z, B'^ — 4:AC = D), then the map (24) is given by 

/n r2\ ^ / ujIzi+uj'^‘^Z2 UJ1UJ2Z1 + UJ^UJ^Z2\ 

VT-2 T3J \UJ1UJ2Z1 + OJ1UJ2Z2 OJ2Z1 + UJ2'^Z2 / ’ ^ 

which satisfies an equation of the form (31) with {a,b,c,d,e) = {C,—B,A,0,0), 
b'^ — Aac — Me = D. In general, to compute the intersection dfundf^i/ we substitute 
the expression in (32) into the an equation of the form (31) with 5^ —4ac—dde = D'. 
This gives the equation 

— dD Z\Z2 (cica^ -f buj\(jj2 cca^)^! ~\~ (uca^ T buj\UJ2 Z2 T e = 0 , 


which has the form (29) with 

N = —deD — N{au)\ + 6 wia ;2 + cw^) 


DD' - {2Aa + Bb + 2Ccf 
4 


as asserted. In the special case D' = 1, we recover the equation D = + AN and 

also see that we are on the component ) of F^, as claimed in (30), since 

it is easily seen that + 6 a;iW 2 + cw| = (mod O). 


5.3. Teichmiiller curves and Veech groups. A Teichmuller curve is an irre¬ 
ducible algebraic curve W in the moduli space Mg of curves of genus g which is 
a totally geodesic submanifold for the Teichmuller metric. Teichmiiller curves are 
generated by a pair consisting of a curve C and a non-zero holomorphic one-form 
w G iJ^(C', Such pairs are called flat surfaces. An introductory text to flat 

surfaces is the survey [47], for example. On the set of flat surfaces there is an 
action of GL(2,IR) and Teichmiiller curves are the projection to Mg of the orbit 
GL(2,]R) • {C,uj). The uniformizing group T such that W = H/r, called a Veech 
group, can be read off from the flat geometry of the pair {C, uj). Let Kr be the trace 
field of r and r = [ATr : Qj- Teichmuller curves with r = g are called algebraically 
primitive. Under the Torelli map, algebraically primitive Teichmiiller curves map to 
the locus of abelian varieties with real multiplication by K ([30] Theorem 2.6). In 
particular for g = 2 the universal covering of an algebraically primitive Teichmiiller 
curve defines a map 

{(po, v?) : H —>• 

equivariant with respect to the action of the Veech group (acting on the left in 
the obvious way and on the right via its embedding into SL(2, AT) ^ SL(2,]R)^). 
The geodesic definition of Teichmuller curves implies that po is a Mobius trans¬ 
formation. Moreover we may suppose po = id using appropriate choices in the 
universal covering map. Gonsequently, Teichmiiller curves define Hilbert modular 
embeddings in the above sense. ([30], Section 3.) 

The space of flat surfaces {C, oj) is naturally stratified by the number and mul¬ 
tiplicities of the zeros of w. In particular, for g = 2 we have two strata HM 2 ( 1 ,1) 
and HM 2 ( 2 ), corresponding to to having two distinct zeros or one double zero, re¬ 
spectively. For g = 2 we have the following classihcation for algebraically primitive 
Teichmuller curves ([23], [24], [31], [25]). 
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Theorem 5.2. There is only one Teichmuller curve in the stratum riM 2 (l,l), 
called the decagon curve. It lies in the Hilbert modular surface X^. 

The stratum rJM2(2) contains infinitely many algebraically primitive Teichmuller 
eurves Wo, each lying in a unique Hilbert modular surface. For each non-square 
discriminant D > 5 the Hilbert modular surface Xo contains exactly one Teich¬ 
muller curve if D ^ 1 (mod 8) and exactly two if D = \ (mod 8). 

The union Wo of the Teichmuller curves in Xo other than the decagon curve is 
the locus in M 2 of curves whose Jacobians have real multiplication by Oo and such 
that the eigendifferential on which Oo acts via the embedding K cM. has a double 
zero. 

The two components in the case D = 1 (mod 8) are distinguished by a spin 
invariant 6 G {0,1} and will be denoted by = ]HI/r|,, so that Wo = W^ U Wj^ 
in this case. The definition of the spin invariant is given in [24] and will not be 
repeated here, but in §9.2 we will be able to give a new and equivalent definition 
in terms of our description of Teichmiiller curves via theta functions. 

The Teichmuller curves in flM2(2) admit the following characterization, which 
is an adaptation of the criterion in [30], Theorem 5.3. Let Hi {i = 1 or 2) be the two 
natural foliations of a Hilbert modular surface Xo for which the i-th cooordinate 
is locally constant in the uniformization. 

Theorem 5.3. An algebraic curve W C Xo is a union of Teichmuller curves if 
and only if 

(i) W is disjoint from the reducible locus and 

(ii) W is everywhere transversal to Ti. 

Sketch of proof. If IT is a Teichmuller curve, then (i) and (ii) hold by definition 
and by the fact that we can use the first coordinate as a parameter, respectively. 

For the converse recall that over a Hilbert modular surface the relative first 
cohomology with coefficients in K splits into two eigenspaces, two local systems 
over K that we denote by L and L and that are interchanged by the Galois group 
of K. Consequently, over any curve in a Hilbert modular surface the cohomology 
splits in the same way. 

Condition (i) is equivalent to W being in the image of the locus of Jacobians 
with real multiplication under the Torelli map. To apply the criterion of [30], 
Theorem 5.3, we need to show that the Kodaira-Spencer map for L or L vanishes 
nowhere on W. Condition (ii) implies the non-vanishing of the Kodaira-Spencer 
map for the corresponding L in the interior of Xo, while at the cusps non-vanishing 
is automatic, by a local calculation as in [6], Proposition 2.2. □ 

One can generalize this setup using algebraic curves in Ag that are totally geo¬ 
desic for the Kobayashi metric. See [32] for a characterization of these Kobayashi 
geodesics. 

5.4. Twisted modular forms for Wo- The topology of Wo and the ratio A 2 
are completely determined, combining the work of several authors. We summarize 
the results and combine them with Theorem 3.2 to determine the dimension of the 
space of twisted modular forms. 
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Theorem 5.4. For any non-square discriminant D, the fundamental invariants of 
the eurves Wd are as follows. 

(i) The orbifold Euler characteristic ofWo equals 

x{Wd) = -IxiXn), (33) 

where Xjo is the Hilbert modular surfaee H^/SL(0^ © Od). 

(ii) The cusps ofWo are in bijeetion with standard quadratic forms of discriminant D 
(see Section 10). 

(Hi) For D = 5, the curve Wd has two elliptic fixed points, one of order two and 

one of order five. For D ^ 5, there are 62(1?) elliptic fixed points of order two 

on Wd and no other fixed points, where 62 (-D) is a sum of class numbers of 
imaginary quadratic orders (|34], Table 1). In particular for D = 1 mod 8, 
there are e 2 {D) = ^h{—4:D) elliptic fixed points of order two. 

(iv) The curves Wf) and Wf, are defined over Q{'/D) and are Galois conjugate. 

(v) The curves W^ and W}) are homeomorphic. 

(vi) For a torsion-free subgroup of the Veech group of any component Wf) ofWD the 
ratio X 2 of the degrees of the line bundles L and L equals 1/3. 

Proof. Statement (i) is the main result of [2], Theorem 1.1. Statement (ii) is implicit 
in [24] and explicit in [2], Theorem 8.7(5). Statement (iii) is the main result of [34]. 
Statement (iv) is Theorem 3.3(b) of [5] and (v) follows directly. Statement (vi) was 
shown in [2], Corollary 12.4, and with a different proof in [5], Corollary 2.4. □ 

We recall that the value of x(^d) is known, and is given for a fundamental 

discriminant D by 

x{Xd) = 2Cic(-l) = ^ E « (34) 

D=62+4ac 

(see [17]), and in general by a similar explicit formula. 

Every curve of genus two is hyperelliptic and consequently, —I is in the Veech 
group for every Teichmiiller curve in genus two. The dimension of the space of 
twisted modular forms can now be deduced from Theorem 3.2. 

Theorem 5.5. For D > 5 the space of twisted modular forms on Wd is zero 
for k -\- £ odd. For k -\- £ even and D ^ 1 mod 8 

dimMfey(r) = -^(k + ^'^x(WD)- 1—^^162(11), 

where {x} is the fractional part of x, and for each of the two components for D = 1 
mod 8 

dimMkxir) = -^(^+x(W^d) - ^ ^^}e2(T>), 

where x{^d) and 02 ( 0 ) is given in Theorem 5.). 

Proof. The first statement holds because the Veech group contains —I. Given the 
general dimension calculation in Theorem 3.2 and the Euler characteristic in (33) 
it remains to show that for all the fixed points x of order two the local contribution 
bx{k,£) is { ~ 4^^ }, not { ~ 4"^ } for some of them. 

Suppose that M = (“^) G SL(0/, © O44) is of order 4 and stabilizes z = 
(^1,2:2) G Then multiplication by the diagonal matrix with diagonal entries 



THETA DERIVATIVES 


27 


((czi + (d’Z 2 + defines a linear map J of that stabilizes the lattice 

£z from (22) (i.e. the corresponding abelian surface has complex multiplication 
by the ring generated by On and J). To show that is hence 

equivalent to showing that J~^ (or J) has two eigenspaces of dimension one, rather 
than a two-dimensional eigenspace. 

Mukamel ([34]) studies, along with his classification of fixed points of Teich- 
miiller curves, the locus M 2 (-D 8 ) of genus two surfaces with automorphism group 
containing the dihedral group of order 8. He shows that all the fixed points of 
order two on the Teichmiiller curves Wd lie on the intersection (in ^ 12 ) of the 
Hilbert modular surface Xo with M 2 (-D 8 )- The family of curves over 'X[ 2 {D^) is 
given by the hyperelliptic equation 

= {X‘^+ aX+ 1), aGC\{±2}. 

The automorphism of order four is J{X,Y) = and the eigendifferentials 

are dXjY Y XdXjY and dXjY — XdXjY^ which lie in the eigenspace for -|-1 and 
for —1 respectively. This proves the claim on the J-eigenspaces. □ 

5.5. Gauss-Manin connection and Picard-Fuchs equation. Here we explain 
why Teichmiiller curves give rise to twisted modular forms and how to obtain the 
differential equations we attached to them in Section 4 geometrically. For the mo¬ 
ment, let W be any curve in M 2 such that the corresponding family of Jacobians has 
real multiplication by an order in K. Then the vector bundle with fiber H^{C,C) 
over the point [C] G M 2 splits (over R, and in fact over iG), as in the proof of 
Theorem 5.3, into rank two subbundles L and L. This vector bundle also comes 
with a flat (Gauss-Manin) connection V. The bundles L and L come with holo- 
morphic subbundles L and £ respectively, whose fibers over X are the holomorphic 
one-forms on C that are eigenforms for the real multiplication. The bundles L and 
L naturally extend over the cusps IT \ IT, where the fibers are stable forms. We 
denote them by the same letters. (We recall that a form is called stable if in the 
limit as t —>■ toj where the genus 2 curve parametrized hy t ^ to degenerates to a 
curve of genus 0 with double points, the corresponding differential on the normal¬ 
ization of this curve has simple poles with opposite residues at the points that get 
identified.) 

Suppose for simplicity that IT is a rational curve with parameter t. If we choose 
sections uj{t) of £ and w(t) of £, then {uj{t),X{d/dt)uj{t),X{d/dt)'^uj{t)} are linearly 
dependent in cohomology. Concretely, this means that if L is the corresponding 
second order differential linear operator, a quadratic polynomial in d/dt, then the 
image of uj{t) under L is exact. Similarly, {uj{t),V{d/dt)uj{t),V{d/dt)‘^uj{t)} are 
linearly dependent and give a second order differential operator L that makes u){t) 
exact. It follows that the periods, defined as the integral of w(t) and w(t) over any 
fixed element of Hi {C, C) are annihilated by L and L respectively. (Here “fixed” 
means that we use the property of being a local system to identify the homology 
groups Hi{Ct,C) = iJi(C't,Z) 0 C with each other locally.) These are the well- 
known Picard-Fuchs differential equations satisfied by periods. 

Now assume that IT is a Teichmiiller curve in M 2 or, more generally, with 
quadratic trace field. We show that the periods just described are (twisted) modular 
forms of weight (1,0) and (0,1) respectively, with respect to a modular embedding 
Lp as defined in Section I. More precisely we have the following correspondence. 
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Proposition 5.6. Suppose that W is a Teichmiiller curve with uniformization 
H/r as above, and let L and L be the rank two differential operators associated with 
sections u}{t) and ui{f) of L and L as above. Then there is a rank-one submodule (in 
the rank-two 0-^(W)-module of solutions of L) consisting of holomorphic modular 
forms of weight (1,0), and a rank-one submodule (in the rank-two Oi^(W)-module 
of solutions of L) consisting of twisted holomorphic modular forms of weight (0,1). 
Ifujft) (resp. dj{t)) extends to a .stable form over a cusp ofW, then the corresponding 
(twisted) modular form is holomorphic at this cusp. 

This defines a 1:1 correspondence between holomorphic sections of L over W and 
holomorphic twisted modular forms on T of weight (1,0), and a 1:1 correspondence 
between holomorphic sections of L over W and holomorphic twisted modular forms 
on r of weight (0,1). 

Proof. In [30] it was shown that there exists an oriented basis (3, a of the kernel 
of w in Hi{C,]S.) such that the monodromy representation of Tri(W) = F on that 
subspace is the identity, and similarly a basis a®’ of the kernel of uj with respect 
to which the monodromy representation is given by the Galois conjugate group F®’. 

Consequently, the period map z i-A J^U!{z)j f^oj(z) is equivariant with F acting 
on domain and range, hence the identity after an appropriate conjugation by a 
Mobius transformation. Moreover, the period map z i-A w(z)/ w(z) is equi¬ 
variant with F acting on the domain and F*^ on the range. Hence this map agrees 
with ip in the definition of the modular embedding by the uniqueness of modular 
embeddings. 

As said above, the periods /(z) = f^oj{z) and /i(z) = f^Lj(z) = zf{z) span the 
space of solutions of L (pulled back to H via f). The statement above about the 
monodromy implies that for all 7 = (“^) GF 

(^ (fiiz)\ ^ ( a/i(z) -b bf{z)\ ^f{az + b)f{z)\ 

\f{iz)) ^ \f{z) J \cfi{z) + df{z)) \{cz + cl)flz)J ' 

The second row implies that / is a twisted modular form for F of weight (1,0). 
Similarly, the periods /(z) = w(z) and /i(z) = cv(z) = p(z)f(z) span the 
space of solutions of L and we have 

+ ^ f(aMz) + bnl(z)] 

[f(7^)J [f(z)J [c<^fi(z) + d<^f(z)J [(c<^p(z) + dnf(z)J ' 

Again, the second row implies that / is a twisted modular form for F of weight 

(0,1). 

Holomorphicity of / and / in the interior of HI is obvious by the dehnition of a 
period. To show that they are holomorphic at the cusps, we may assume without 
loss of generality that zq = 00 and tg = 0. There, it follows from the definition of 
the monodromy representation that a (resp. a'^) is characterized in (w)"*" (resp. in 
(a;)'*") as the elements invariant under the local monodromy group. The period of 
a stable form along such a cycle is finite. 

To establish the last statement of the proposition, we just need to assign to every 
holomorphic twisted modular form of weight ( 1 , 0 ) (resp. weight ( 0 , 1 )) a section of 
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L (resp. of £). This is well-known in the untwisted case and was done in both the 
untwisted and twisted cases in Section 4 of Part 1. □ 

We end with a remark on Galois conjugation and spin. We defined P to be 
monodromy group of the local system L. Then, of course, the monodromy group 
of the Galois conjugate L is P'^. We will see in the next section in an example, and 
at the end of the paper in general, that the solutions y and y also have coefficients 
in the field K. However, the Galois conjugate solution y'^ is neither equal to y 
nor to any other solution of L. In fact, y'^ is naturally a solution of a differential 
operator associated with the Galois conjugate Teichmiiller curve . For D = \ 
mod 8 this is the curve with the other spin invariant (see Theorem 5.4 iv)). For 
D ^ 1 mod 8 this Galois conjugate curve is isomorphic to the original curve by 
McMullen’s classification recalled in Theorem 5.2. The equation of this curve for 
H = 13 is given explicitly in [5]. 


6. Example: the curve and its associated differential equations 

Our running example, from now until the end of Part II, will be the Teich- 
miiller curve on the Hilbert modular surface Xn. In this section we gather 
the known results for this curve, computing the Veech group and summarizing 
the construction from [5] to compute the equation of the universal family and the 
corresponding Picard-Fuchs differential equations. 


6.1. The Veech group for D = 17 and spin 1. For small values of D the Veech 
groups rfj = r(IF|,) can be calculated using the algorithm in [23]. This is sufficient 
for our purposes, but we emphasize that a general algorithm to compute the Veech 
group for any Veech surface has been developed and implemented by Mukamel [35]. 
We describe this in detail for the case D = 17, 6 = 1, i.e., for the Teichmiiller curve 
of non-trivial spin. McMullen’s algorithm gives a subgroup of SL(2,0 d), so the 
group that we will get (which we will denote simply by F, or by ri7 when needed 
for clarity, with quotient W = H/F) is actually the conjugate AFjyA”^ of Fjy by 
A = (p ^) , and for the same reason the function ip used to make the modular 
embedding will go from El to El” rather than from El to H. Later, when we use this 
modular embedding explicitly to compare the twisted modular forms on W with 
standard Hilbert modular forms for Q(vT7), we will conjugate back to make the 
comparison easier. 

We denote by a = {l + ^/vf)/2 the standard generator of 0 = O17 over Z, and for 
ease of reading will sometimes use the abbreviated notation [m, n] for m + na G 0. 
The group F can be embedded into SL(2, R) by the standard embedding of 0 into R 
and then acts discretely. (Note that the other embedding of 0 into R would lead 
to a non-discrete subgroup of SL(2,R)!) A fundamental domain for this action 
is shown in Figure I (b), while Figure 1 (a) shows the explicit “L-shaped region” 
needed to apply the algorithm and obtain the fundamental domain. The group F 
has three cusps, at z = 00, I, and a/2, and an elliptic fixed point of order 2 at z = i, 
where z is the coordinate in El. The stabilizers of the cusps are the infinite cyclic 
groups generated by the parabolic elements 


Moo 


1 a 
0 1 


Ml 


/ —2q: — 2 2o: -j- 3\ ^ f —2o: — 3 3 q: -t- 4\ 

\^-2a - 3 2a + l) ^ = y_2a _ 2 2a + 5) ’ 
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Figure 1 . (a) Flat surface generating the Teichmiiller curve W17 
when a = (1 + \/T7)/2. The square is the double zero of w, the 
black points are the remaining 5 Weierstrass points. 

(b) A fundamental domain for F. 

and the stabilizer of i is generated by the element Mi = (q ). The presentation 
of the group F given by McMullen’s algorithm is then 

F = (Moo , Mi, Ml, M„/2 I Mf = -1, MooMiMiM„/2 = l) . 


It will be useful in the sequel to deal not only with the Fuchsian group F but 
also with a certain index 4 subgroup 11 of it, already mentioned in §3. This group is 
more convenient for purposes of calculation because it is free and also because the 
universal genus 2 curve over H/II has a stable model. We want that FI = 11 x {±1} 
has index two in F. (This already implies that 11 has no torsion and is thus free.) 
This group is not unique. We fix the choice 

n = (Moo, M^/2, Ml -M-^MooMi). 

The group 11 = Fix {±1} is the kernel of the homomorphism from F —>■ {±1} 
sending Mqo and Mq ,/2 to 1 and Mi and Mi to —1. 

Both curves H/F and H/II have genus 0, so there are modular functions s(z) 
and t{z) on F and FI giving isomorphism between their compactifications and P^(C) 
(hauptmodules). We can normalize them so that the involution induced by Mi : 
z !->■ —l/z corresponds to t >->• l/t and that the three cusps of H/II are at 0, I 
and 00 . Then the quotient map 1 1 —s is given by 


4 Ko t 

" = "(T^’ 


where kq 


-895 + 217^17 
256 


(35) 


and the values of s and t at the cusps and elliptic fixed points are given, according 
to the calculations in [5] (where a different parameter on H/II was used), by the 
table 



Z 

00 

0 

1 a/2 

—2/a i 

n 

t = t{z) 

0 

00 

1 A-" 

A 

-1 

F 

s = s(z) 

0 

0 

00 1 

1 

KO 


( 36 ) 
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where 




31 - 7^17 


Ko 


= s{i) = - 


(x-iy 


(37) 


2 ’ - V / 4 ;^ 

For later use we emphasize that both t and s are local parameters of the Teich- 
miiller curve at the cusp z = 00 . The whole situation is summarized by the following 
diagram. 


ITn = H/n^^pi(C) 


Ty = Ei/r ^ > pi(c) 

6.2. The universal family over W. The modular curve W = H/T parametrizes 
a family of genus 2 curves with real multiplication by 0 = Z+Za on their Jacobians. 
This family, and its associated Picard-Fuchs differential equations, was determined 
explicitly in [5]. In this subsection we review these results, and also give somewhat 
simpler equations by making suitable changes of coordinates. We remark that 
meanwhile equations of more Teichmiiller curves have been computed by a different 
method by Kumar and Mukamel ([22]). 

The explicit equation in [5] was actually given for the family over the double 
cover H/n of W, with the parameter t, and has the form 

= F 5 (X,t) = (X + (At + B)) X (X + (Bt + A)) x 
X (X^ + C(t + 1)X^ + (B(t + If + Et)X + F{t + If + Gt{t + 1)) , 
with coefficients A,... ,G given (with our above notation [m, n] = m + na) by 

A = 5[2,l], B = -2[5,3], C = [3, 1 ], D =[827,529], 

4 (39) 

S = 2"^ • 17[3,2], F’= --[4597,2943], G = 2 • 17 • [271,173]. 

(These coefficients are not quite as bad as they look since they all factor into small 
prime factors, e.g. D = —Tr^^/de^ and F = — 7 r 2 ^e ^/2 where e = 4 + vTz is the 
fundamental unit of Q(vT7) and 712 = (3 + vT7)/2 one of the prime factors of 2.) 
We explain briefly how this equation is derived. 

We can represent the fiber Gt over t as a hyperelliptic curve = Be {X, t ), where 
Pq is a polynomial of degree 6 in X whose roots correspond to the Weierstrass points 
of Gt- From the action of the subgroup 11 C F in the Veech groups on these points 
(which can be analyzed by looking at the Figure lA, in which the Weierstrass 
points are indicated by black points), we see that they break up into three orbits 
of size 1 and one of size 3, with two of the 1-element orbits being interchanged by 
the symmetry t 1 —>■ . Placing the other 1-element orbit, the singularity of w, at 
infinity, we get a new equation of the form (^x, t) where P 5 factors into 

two linear and one cubic polynomial, and by degree computations together with the 
symmetry under t ^ we find that these factors must have the form given in (38) 
for some constants A,..., G. They are not unique, since we can make a change of 
variables X >->• aX + jSf+l), but become unique up to scaling if we assume that the 
two eigendifferentials u) = dvi and w = dv 2 are dXjY and X dXjY, respectively. To 
determine them, we note that at each cusp t = c G {0, 1, A} this polynomial must 
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acquire two double roots (the degeneracy of the genus 2 curve at infinity cannot 
consist of just two roots coming together, because the real multiplication forces the 
subspace of Hi{C; Z) that collapses to be an 0-module and hence to have even rank 
over Z), so we have P^{X, c) = {X — Xq){X — X^)'^{X — (The corresponding 

factorizations at the two other cusps t = oo and are then automatic because 
of the t ^ symmetry.) These conditions do not yet suffice to determine the 
equation, but there is one further condition at each cusp. This comes from the fact 
that a singular curve of the form = {X — Xo)(X — Xi)'^{X — ^ 2 )^ has genus 0. 
A parametrization with parameter T is given by A = (Ai — Xo)T^ + Xg and 
Y = {Xi- Xo)^/‘^T{T^ -1){T^ - p"^), where = {X 2 -Xo)/{Xi-X q) denotes the 
cross-ratio of Xq, Xi, X 2 and 00 . The differential form dXjY corresponds under 
this map to a multiple of the differential form — 7 ^ -b ^ 3 ^ — dT on 

having four simple poles with residues summing to 0 in pairs, and with the ratio 
of the non-paired residues being ±p. But for the cusps of the Teichmiiller curves 
we know a priori that these ratios of residues must equal the ratio of the top and 
bottom sides of the L-shaped region as it degenerates. From the horizontal sides 
Figure lA we read off the value po = Redrawing this figure decomposed into 
cylinders in the direction of slope one (corresponding to the cusp t = 1) and slope 
a/2 (corresponding to the cusp t = A by the table in the preceding subsection) we 
find Pi = a/2, = (1 + ot)l2. This information now suffices to determine all of 

the unknown coefficients, up to the ambiguity already mentioned (in particular the 
second eigendifferential form u) = XdXjY automatically has the correct ratio of 
residues, namely, the Galois conjugates of the ones for w, so that there are no extra 
restrictions on the coefficients coming from this condition), and carrying out the 
calculation we find the values given in (39). 

We remark that equation (38) can be simplified considerably by substituting 
(1 -b t){l -b AvT 7)/4 for X, in which case Pb{X, t), up to a factor (•\/T7(l -b t)/Y)^, 
takes on the much simpler form 

F 5 (A, u) = {{X - 1)2 - [4, 5] u) {{X + VT7){X + 1)2 - 8u{2X + [9,5])) 

with u = Ae = 1 -s/k.q ■ This gives an explicit and relatively simple equation 

for the family of genus 2 curves over the Teichmiiller curve H/F. 

6.3. The Picard-Puchs equations for W and their solntions. As already dis¬ 
cussed in Section 3, even though we are considering only the single curve W = H/F, 
there are two Picard-Fuchs differential equations, corresponding to the variation of 
the periods of the two eigendifferentials oj and ui for the action of 0 on the space 
of holomorphic differentials of the fibers. It will be crucial for our calculations to 
have both of them, since together they will tell us explicitly how the Teichmiiller 
curve W is embedded in the Hilbert modular surface A 17 . 

Obtaining the Picard-Fuchs differential equations satisfied by the periods of the 
two eigendifferentials to and w is straightforward once the equation of the family of 
curves has been obtained. One has to find differential operators L and L mapping 
the one-forms oj and w to exact forms. The result, given in [5], is a pair of differential 
operators of the same form as in ( 20 ), namely 


(40) 
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where A{t) and B{t) are the polynomials given by 


A{t) = X){t - A"^) = + pf -t, 

-SW = I (3^^ - (/3 + 7)i + 7 ) , 

with A = (31 — as in (37) and P and 7 defined by 

^ , 1 ^ 1087-217717 27-5717 

'5 = " + ^ +' =-M-• ^ = • 

and where A{t) and B(t) are the rational functions 


A{t) 

Bit) 


N ! ( 1 137-95717 

Ait)j (7 + - t + 1 


(i 


128 

^4 ^ 1113-399717 ^3 _ 


512 


1387-301717 23-57T7N 


128 


t + 


260375 - 69633717 ^2 
16384 ^ 

137-95717 


-)li 


7 + 


128 


^ + 1 


(41) 


(42) 


(43) 


The differential operator L has five singularities, at infinity and at the roots of 
Ait). The differential operator L has seven singularities, these five and two more at 
the poles of A, but these last two are only apparent singularities of the differential 
equation, i.e., all solutions of the equation are holomorphic at these points. 

The unique solutions in 1 + tC[[t]] of the differential equations Ly = 0 and 
Ly = 0 can easily be calculated recursively. The first few terms are given by 


_ 1 I 81-15717 f I 4845-1155717 j.2 , 3200225-775495717 ^3 , 

y -i“r 16 "*" 64 '-“T 2048 "I" • • • 

« 1 + 1.197t + 1.294 7 + 1.356 7 + 1.402 7 + 1.439 7 + ••• 

~ _ 1 I 23-bVT7 j. I 5561-1343v^ ±2 , 452759-109793v^ ±3 , 

y — 1 -t- g t -h ^2S ^ “T 53^2 i -t • ’ ’ 

« 1 + 0.2981t + 0.1849 7 + 0.1384 7 + 0.1131 7 + 0.09737 + ••• 


There are also unique power series yi and yi without constant term such that 

y log(t) + yi and y log(t) + yi are solutions of the same differential equations as y 

and y, respectively. These series begin 

_ 439-97717 f 563089-135575717 ^2 , 200641639-48642353717 ^3 , 

yl “ 64 4096 '' 65536 '' 

« 0.6103t + 1.0017 + 1.2837 + 1.5047 + 1.6877 + ••• 

~ _ 1575-369v^ f I 1749337-423695v^ ±2 , 1764480419-427927381v^ ±3 , 

128 ^ 8192 ^ 393216 ^ "T * ' * 

« 0.41851 + 0.29277 + 0.2305 7 + 0.1958 7 + 0.1748 7 + ••• 


We have given the numerical values of the first coefficients of each of these 
four power series to emphasize that they are quite small (and the same is true of 
the first few hundred, which we have computed). In fact, the coefficients in each 
case grow like A", where A = 1.06913- • • is the number defined by (37), since the 
radius of convergence is the absolute value of the nearest singularity t ^ 0, and 
the singularities are at t = 0, A“^, 1, A and oo. The growth in each case is quite 
regular, with the coefficient of t" being asymptotic to a constant times A"/n. It is 
perhaps worth mentioning that if we took the Galois conjugates L°' and L'^ of the 
differential operators L and L, which give the Picard-Fuchs equations for the other 
Teichmiiller curve IF{y (cf. [5] or §9), then the power series y,...,yi would also 
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be replaced by their Galois conjugates and would look algebraically very similar to 
those above, but would have completely different real coefficients and growth, e.g., 
the expansion of y'^ begins 

„,<T _ -1 I 81 + 15^17 i I 4845+1155vdT ^2 , 3200225+775495^17 f3 , 

i/ — 1 4- 10 <-4- 04 I- -I- 2048 '• 4- • • ■ 

« 1 + 8.9281 + 150.11^2 + 3123.9+ 71667+ 1738907+ ••• , 

now with coefficients growing like (A'^)”/n with = 29.93086 • • •. The corre¬ 
sponding Fuchsian group, although isomorphic to 11 as an abstract group, is not 
conjugate to it in SL(2,]R), and the quotients of the upper half-plane by these two 
groups, which are the curves and VF^, represent different points of the moduli 
space Mo, 5 . 

Another very striking property of the expansions of y and y given above (and 
then of course also of their conjugates y'^ and y'^) is that the only denominators one 
sees are powers of 2 , i.e., the first few coefficients of these power series all belong to 
the ring 0 ( 5 ]. A calculation to higher accuracy shows that the same holds for the 
first few hundred coefficients, and in fact it is a theorem, proved in [5], that it holds 
for all coefficients. We will return to this question at the end of the next section 
because it is has a very interesting aspect that was in fact the point of departure 
for our whole investigation. 

7. Arithmetic properties of modular forms for 

With the preparations in the preceding sections we can now compute in §7.1 
the modular embedding ip in the example D = 17. In the process we compute the 
Fourier expansions of some modular forms and later, in §7.3 we completely deter¬ 
mine the ring of twisted modular forms in this specific example. This arithmetic of 
the coefficients reveals two surprising phenomena, a transcendental constant needed 
for the correct choice of the g-parameter and the integrality statement mentioned 
above and proved in [5], which cannot be explained using one modular g-variable. 
We discuss these in §7.2, and provide the explanations in Section 8 . 

7.1. Modular parametrization of the differential equations. We have al¬ 
ready mentioned that the differential equations Ly = 0 and Ly = 0 have the same 
form as the differential equation ( 20 ) satisfied by ordinary or twisted modular forms 
with respect to a hauptmodule. This is of course not a coincidence: we have 

= fiz), y{t{z)) = I{z) for Q(z) large, |t(z)| small 

where t : H/II —>■ C is the map defined in §6.1 and f{z) and f{z) are a modular 
form of weight 1 and a twisted modular form of weight ( 0 , 1 ), respectively, on 
the same group 11. In this subsection we will work out this statement in more 
detail, obtaining in particular a way to calculate the expansion (5) of the function 
(/? : H —>■ HI” whose graph gives the embedding of H/II into H x H“/SL(2, O 17 ) as 
discussed in Section I. (Here we need H“, rather than H as in Section I, because 
we have conjugated the original Veech group by A = (g yjy) to embed it into 
SL(2, Od) and the Galois conjugate of A has negative determinant.) We will also 
calculate the g-expansion of f(z), obtaining our first explicit example of a twisted 
modular form. In Section 8 we will use this information to determine completely 
the rings of twisted modular forms for F and H and the algebraic description of the 
Teichmiiller curve W inside the Hilbert modular surface A 17 . 
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We begin with the functions y and /. As stated in §6.1, the cusp at infinity 
for either F or 11 has width a, i.e., its stabilizer is generated by the transformation 
z z + a, where a = (l + vT7)/2 is our standard generator of On, so any modular 
function or modular form on either group can be written as a power series in the 
variable q = . On the other hand, we know that the space of solutions of the 

differential equation satisfied by any weight 1 modular form f{z) with respect to 
any modular function on the same group is spanned by f(z) and zf{z). Our first 
thought is thus that q coincides with the “mirror parameter” 

Q = Q{t) = ^exp(yi/y), (44) 


where y = y(t) and yi = yi{t) are the two power series in t defined in §6.3. This is 
indeed what happens in the case of the Apery or Apery-like differential equations 
(see [46]), at least if one normalizes the hauptmodule correctly. Here, however, it 
is not quite true. We can see this numerically as follows. The function Q{t) has a 
Taylor expansion beginning 


Q(i) = t + 439-97vT7 ^2 


249125-60195vT7 

2048 




(45) 


We can invert this power series to obtain 


t = t{Q) 


Q 


439-97vT 7 ^2 , 103549-24971vT7 ^3 , 
64 V V 2048 ^ 


(46) 


and then substitute this into the expansion of f{z) = y{t) to express f{z) as a 
power series 


f{^\ _ 1 I 81-15^17 ^ I 8613-2019^17 /->2 , 726937-175823^17 /->3 , 

j \^Z) — l .g V "T 512 V -h 16384 h? "r 


« 1 -k 1.197 g -k 0.563 g2 -k 0.122 -k 0.0082 - 0.0011 Q® - • • • 


(47) 


in the new local parameter Q at infinity. Looking at the first few numerical coeffi¬ 
cients in this expansion, we see that they seem to be tending to 0 rapidly, suggesting 
that the radius of convergence of this power series is larger than 1, which is the 
value it would have to have if we were expanding with respect to q. The point is 
that, although the function logQ = log! + yi/y has the same behavior at infinity 
as 2Triz/a, namely, that it is well-defined up to an integer multiple of 2Tri, this 
property determines it only up to an additive constant. Therefore q and Q are 
related by 

Q = Aq = (48) 

for some constant A ^ 0 that has no reason to be equal to 1. The radius of 
convergence of the series in (47) is then equal to the absolute value of this constant. 

We can use this idea, or a modification of it, to calculate A numerically. First, by 
computing a few hundred coefficients of the series in (47) and calculating its radius 
of convergence by the standard formula R = liminf where an denotes 

the nth coefficient, we find that |A| is roughly equal to 7.5. However, this direct 
approach has very poor convergence (because the coefficients of the expansion of 
f{z) in g, unlike those of the same function when written as a power series y{t) 
in t, do not behave in a regular way), and anyway gives only the absolute value 
of the scaling constant A. To find the actual value to high precision, we apply a 
simple trick. From the data in the table (36), we know that the value of t{z) at 
z = i equals —1 and that this value is taken on with multiplicity 1 (because t is a 
hauptmodule for a group with no elliptic fixed points) and is not taken on at any 
point in the upper half-plane with imaginary part bigger than 1 (because i and its 
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translates by multiples of a are the highest points in the Il-orbit of i). It follows 
that the function + 1) has a simple pole at z = i and that if we express this 

function as a power series in q (resp. Q), then its singularity nearest the origin is a 
simple pole at <70 = (resp. Qo = Aq^). In other words, 1/(1+ t) is the sum of 

c/(l — Q/Qq) for some non-zero constant c and a function holomorphic in a disc of 
radius strictly larger than jQol- This implies that if we expand 1/(1 -|-t) = ^ 
then the coefficients are given by = cQq^(1 + 0(a“”)) for some a > 1, and 
hence that the quotients &„+i/6„ tend to Qq with exponential rapidity. Calculating 
a few hundred of the coefficients numerically, we find from this the value 

A « -7.48370822991173536914114556623211 


to very high precision. After some trial and error we can recognize this number “in 
closed form” as 

and we will see later that this guessed value is indeed the correct one. 

Equations (46)-(49) now give as many terms as desired of the g-expansions of 
the modular function t(z) and modular form f(z). We can (and of course did) then 
use this to check the correctness of these equations numerically to high accuracy 
by verifying the invariance of t(z), and the invariance of f{z) up to an automorphy 
factor cz + d, under modular transformations z 1 —(az + b)/(cz + d) in the group 11. 
Similarly, by inverting (45) we can also give the inverse of the uniformizing map 
H —>• Wn explicitly as 


z{t) 


a Q 

Qi /, i 


439-97^/T7 f I 321913-77807v^ ^2 
64 ^ 4096 ^ 


(50) 


Exactly the same considerations apply to the second differential operator L, with 
the difference that here the mirror parameter 


Q 


tgVi/v = t I 1575-369vA7 ^2 
128 


4814915-1166773vA7 
16384 


+ ••• 


is related to the variable z in the upper half-plane by 
Q = Aq with q = 


where : El —>■ El is the twisting map and A is some constant. A calculation like 
the one for A gives the numerical value 


A « -40.9565407890298922716044572957685, 


which we can recognize as the “conjugate-in-the-exponent” of the value in (49): 

~ ? /- /5 - v77\(-vA7-1)/4 

A = -2(3 + yi7)(-. (51) 

We will show later that also this formula is indeed correct. 

We can now calculate the Eourier expansions of both the twisted modular form 
/(z) = 2/1 (t) and the twisting map : El —>• EI“ as 
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and 


ip{z) 


a" Q a" A 

= 271 


27ri 

a' 


t H” 


2/1W 


y{t) - log A' 


/loo- A -1- 1575-369^17 ^ , 1208617-292799VT7 ^2 , 

2 ^^ ^ 128 ^ 


8192 


697-175VTf 

128 


'•) 


dig 


I -29767+7249v^ a2 2 , 3091637-749587v^ 43 „3 , \ 

8192 q 393216 ^ "h • • • ^ . 


(52) 


Again these Fourier expansions, unlike the expansions of the same functions as 
power series in t{z), converge exponentially rapidly for all z in the upper half¬ 
plane and can be used to compute the functions f{z) and Lp{z) numerically and to 
verify the modular transformation properties ( 1 ) and = {c'^(p{z) + d'^) f{z) 

numerically to high accuracy, giving us the first explicit example of a non-classical 
twisted modular form on a Teichmiiller curve. 


7.2. Modularity and integrality. At the end of §6.3 we stated that all the co¬ 
efficients of the expansions of y and y as power series in t belong to the ring 0 [i]. 
This integrality has a rather puzzling aspect, which we discuss here and resolve 
in §8.3. 

If we write y as X) Cn/”) then the differential equation Ly = 0 translates into the 
recursion 

(n -b l)^c„+i = (/3(n^ +n) + fy) c„ - (/3(n^ - i) + jl) Cn-i + (n - 5 )^c „_2 

for the coefficients c„, where /3 and 7 are given by (42). The integrality (away 
from 2 ) of the Cn is far from automatic from this recursion, because at each stage 
one has to divide a linear combination of previous coefficients by (n -I- 1)^, so that 
a priori one would only expect n!^c„ to be 2-integral. Divisibility properties of this 
type are familiar from well-known recursions like the recursion 

(n-l-l)^A„+i = (lln^-b lln-b 3) A„ — A„_i (53) 

used by Apery in his famous proof of the irrationality of C(2), or the similar one 
he used in his even more famous proof of the irrationality of C(3). However, they 
are extremely rare. For instance, in [46] it was found that of the first 100,000,000 
members of the three-parameter family of recursions obtained by varying the co¬ 
efficients “11,” “3” and “— 1 ” in (53), only 7 (if one excluded certain degenerate 
families, and up to scaling) had integral solutions. 

Apery proved the integrality of the solution of his recursion (53) by giving the 
explicit closed formula A„ = X)fc=o (fe)^ (”n^) ■ kiiow a corresponding ex¬ 

pression in our case. However, soon after Apery’s original proof, a more conceptual 
explanation was found by Beukers [4], who saw that the differential equation cor¬ 
responding to Apery’s recursion has a modular parametrization y = f(z) G Mi(r), 
t = t{z) G M“®‘'(r) of the type discussed in §4, the group F in this case being Fi (5), 
and this implies the integrality because we have f{z) G Z[[g]], t{z) G g -b g^Z[[g]] 
and hence y G Z[[t]]. Similar statements hold for all seven of the “Apery-like” 
equations mentioned above, leading to the conjecture (which was made explicitly 
in [46]) that the integrality property for recursions of this type occurs precisely 
when the corresponding differential equation has a modular parametrization. 
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The surprise is now this. In our case, just as in the seven “Apery-like” ones, the 
differential equation (at least for y) is modular, and does have power series solutions 
with integral coefficients (away from 2), but now the modularity does not explain 
the integrality in the same way as above, because here the relevant Fuchsian group 
is not arithmetic and the g-expansions of t and y are not integral. Indeed, as we 
saw in §7.1, the coefficients of these (/-expansions are not even algebraic numbers, 
since they involve powers of the scaling constant A, which according to (49) and 
Gelfond’s theorem is a transcendental number. But even if we rescale by replacing 
q hy Q = Aq, then, although the first few coefficients as listed in equations (46) 
and (47) have denominators that are powers of 2, this property fails if we compute 
more coefficients. For example, the coefficient of in f(z) equals 

16063132006911958155776129 - 3895881761337356780171815^17 

253. 33 .5 • 7 > 

and calculating further we find that the first 100 coefficients contain in their de¬ 
nominators all primes less than 100 that do not split in Q(vT7), and similarly 
for t{z). Thus, although our differential equation (40) does not actually contradict 
the hypothetical statement 

“integrality occurs only when the differential equation is modular” 
mentioned above, the mechanism 

y{t) modular y and t both have integral (/-expansions 
y has an integral t-expansion 

which previously explained that statement now breaks down completely. This puz¬ 
zle, which was in fact the original motivation for the investigation described in this 
paper, will be solved in Section 8, where we will provide a purely modular explana¬ 
tion of the integrality property by expanding y and y with respect to both q and q, 
using Hilbert modular forms rather than modular forms in one variable. 

7.3. The ring of twisted modular forms for W and Hff. We can now calculate 
the rings and M*_*(n, </?) of twisted modular forms on the Teichmiiller 

curve W = H/F and its double cover Wn = H/H. This information will be used in 
the following section to embed the curve IF^^y into the Hilbert modular surface A 17 . 

We already know two twisted modular forms on H, namely f{z) = y{t{z)) in 
Mi^o(n) and f{z) = y{t{z)) in Mo,i(n). (From now on we omit the ‘V”.) Any 
holomorphic or meromorphic twisted modular form of weight (fc, £) on H is then 
equal to times a rational function of t = t{z). The next proposition tells us 
which ones are holomorphic. 

Proposition 7.1. For k, £>0 the vector space of twisted modular forms of weight 
{k, £) on H is given by 

(n) = ( f{zf]{z)^ t{zr \3k + £>2c>0)^. (54) 

Proof. The group H has no elliptic fixed points and five cusps, one of which is 
irregular, so Theorem 5.5 implies the dimension formula 


dimM(fc_^)(n) = 1 + 


3k+ £ 
2 


(55) 
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Since the right-hand side of this equals the number of monomials in (54), it 

suffices to prove that each of these monomials is holomorphic or equivalently, that 
f{z) and f{z) are holomorphic everywhere (including at the cusps) and vanish to 
orders 1/2 and 3/2, respectively, at the cusp t = co, where the order is measured 
with respect to the local parameter 1/t. The holomorphy at the cusps is a special 
case of Proposition 5.6, since the construction of the defining equation (38) of W 
given in §6.2 was based on choosing the coefficients in such a way as to make the 
two differential forms uj = dXjY and tu = X dXjY stable at all of the cusps of W. 
We therefore only need to check that the order of vanishing of / and f at z = 0 
(corresponding to t = oo) are at most, and hence exactly, equal to 3/2 and 1/2, 
respectively. We will give two arguments to see this. 

The first way is to use the action of the element S = Mi of T, which corresponds 
to the involution t i—>■ 1/t on the base of the family (38). This involution extends 
via X I—>■ X/t and Y i—>■ Y to an involution i of the whole family, with = 

and L*uj{t) = Near t = 0, the section / is the period ofw along the 

unique cycle (up to scale) /3o that is orthogonal to uj{t) and extends across t = 0. 
Near t = oo, it is the period of w along the unique cycle (up to scale) Poo that 
is orthogonal to w(t) and extends across t = oo. From these defining properties 
it follows that i*Po is proportional to Poo- The same argument applies for /. In 
modular terms, this translates into the statement that the function f(z) transforms 
via 

;/(-;) = (56) 

for some appropriate choice of the square-root of t{z)^^‘^ (which we need only make 
at one point since this function has no zeros or poles in H), and similarly 

(57) 

Equations (56) and (57) clearly imply the statement that the modular forms f'^t^ 
and at oo are holomorphic everywhere, as claimed. 

The other approach, not using the accidental fact that the cusps 0 and oo of H/II 
happen to be interchanged by an element in the normalizer of If and therefore 
applicable in other situations, is based on the equation 

^t'(z) = f{zfA{t{z)), (58) 

where A{t) is the 4th degree polynomial given in (41), see (19). Since the poly¬ 
nomial A(t) is divisible by t, we find that f{z)‘^ multiplied by a cubic polynomial 
in t{z) is equal to the logarithmic derivative of the modular function t{z), and hence 
is holomorphic at the cusp t = oo, so the order of / at t = oo is < 3/2, as desired. A 
similar argument, this time using (21), applies also to /, with the polynomial A(t) 
replaced by the rational function A{t), which grows like as < —>■ oo. □ 

Remarks, i) We make some comments about the half-integer order of / and / 
at oo and about the appearance of the function ^/t{z) in (56) and (57). By the 
discussion in §6.1, the image II of 11 in T = r/{±l} has index 2 and is hence 
normal, so S'nS'“^ = Ft and hence F\kS must be a modular form on 11 for any 
modular form F (twisted or not) of even weight k on 11. But the subgroup 11 of T is 
not normal and is not normalized by S, so the intersection of 11 with 11' = S'nS'“^ 
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is a proper subgroup (of index 2) in F. The space of holomorphic modular forms 
of weight 1 on this group is 4-dimensional, spanned by the functions with 

0 < J < 3, with the spaces of modular forms of weight 1 on If and 11' separately 
being spanned by (/,/t) and by respectively. Notice that the group 

n n n' does not contain the stabilizer Mao of oo in F (or If), but only its square, 
so that it has width 2 and hence a local uniformizer at oo. This group has 
genus 0, with as a hauptmodule. 

ii) In the above proof we gave an implicit estimate of the period integrals that 
dehne / and / in the neighborhood of any cusp. These periods are given by inte¬ 
grating uj = dX/Y and w = XdXjY over a linear combination of paths that are 
invariant under the local monodromy around the given cusp. It is perhaps worth¬ 
while giving a more explicit proof in the special case at hand, since this makes the 
argument clearer and also shows how to find the full expansion, and not just the 
order of vanishing, of / and / at every cusp. We will just give the main formu¬ 
las, without complete details. At each cusp we choose a local parameter e = Sj. 
Then we can find the expansions of y{t) = f{z) and y{t) = f{z) near t = tj by 
looking at the explicit form of the degenerations of the differentials uj and w there, 
as explained at the end of the discussion in §6.2. We consider the cusp t = oo 
here and the other cusps even more briefly in iii) below. Near t = oo we make the 
substitutions {X,t) = i), with c = and A = 5 as in (39), 

and where e tends to 0. Then by direct computation we find 

P,{X,t) = {c/ef [T (T^ - A?) (T^ - A^)]^ + 0(£-4), 

with Ai = and A 2 = The fact that the leading coefficient of the 

right-hand side as a Laurent series in e is a square corresponds to the degeneration 
of the fiber over the cusp to a rational curve, and lets us compute the differential 
form zj = dXjyJP^{X, t) as 


w 


2 ^ 3/2 

c3/2 


1 

(T2 - A?) (T2 - Ai) 


-f ••• 


dT, 


where the omitted terms contain higher powers of e with coefficients that are ratio¬ 
nal functions of T having poles only at ±Ai, ±A 2 that can easily be found explicitly 
with a suitable mathematical software program. The factor gives the vanishing 
order we claimed, and the rest of the expansion gives us the complete expansion 
of f{z) near z = 0. Specifically, the homology of \ {±Ai,±A 2 ,oo} is spanned 
by the four small loops 7 ^ around the four poles ±Ai and ±A 2 , and the integral of 
the above form around each such loop is given simply by the residue of the form at 
that pole, so that we can easily calculate the periods around each 7 ^ to any order 
in e. When one does this calculation, one finds that these integrals are given, up 
to a constant, by 


uj{e) = ( 1,-1 


1 + V^ 
2 ’ 


l + yi7 
2 


> 72 , 73)74 


)(i + 


81-15vT7 „ I 4845-1155vT7 „2 
16 ^ 64 ^ 


The surprising observation that they are all proportional is explained by the fact the 
loops 7 i correspond to the core curves (in both directions) of parallel cylinders in the 
generating flat surface, here concretely the vertical cylinders in Figure 1 (a). More 
precisely, these curves stay parallel in a neighborhood of the cusp of the Teichmiiller 
curve by definition of these curves, and hence the periods remain proportional. 
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j 

t 

z 

Wj 


1 

0 

00 

a = i±#! 

%/T7-1 

2(3 + 7T7)(5-f^) " 

2 

1 

1 

8 + 2717 

_ _ 5 ++T 7 

4 (3 + 717) (4 + 717) 4 

3 

A-i 

a/2 

-3 + +T 7 

2 

39+vT7 

211(4 + 717)1* (5+^] ^ 


Table 1. Cusps 


Another striking property of the above expansion, namely the integrality (up to 
powers of two) of its coefficients as a power series in the local parameter e = Xjt 
at the cusp t = oo, is obvious both from the proof in [5] and from the one that we 
will give in Section 8 . 

iii) We now also indicate briefly how to find the Fourier expansions of / and / 
at a cusp tj other than t = 0. To define them, we must first choose a matrix 
Mj mapping the point Zj with t{zj) = tj to oo. For definiteness’s sake we choose 
Ml =Id and Mj = (i ) for j ^1. The width wj of the cusp is defined as the 
smallest positive number w with M~^ (J ^)Mj € F, and we define qj = 
and Qj = AjQj, where Aj is chosen so that Qj = e + O(e^) as e —>■ 0. Notice 
that both the width Wj and the value of the scaling constant Aj depend on our 
choices of Sj and Mj; with the choices given above, they are given as in Table 1 
for representatives of F-equivalence classes of cusps. At each cusp we expand the 
integral as a power series in the local parameter, observing that each coefficient is 
the integral of a rational function on a punctured Riemann sphere, and proceed 
just as we did above for t ^ oo. We omit the calculations and give only the results 
(for /; those for / can be obtained in the same way): 

/I1M2 = i(3 + 717)5/2(4+717)1/2 (1 - iQa + 3807+915 vT7 q2 + ...) ^ 

/|l^3 = ^(3 +717)11(4+ 7T7)-4 (1 + -255+1959+T7 q^ + ^ 

This concludes our discussion of the modular forms and twisted modular forms 
on the group 11. Proposition 7.1 also allows us to describe the twisted modular forms 
on the Teichmiiller curve itself, i.e. for the group F. Since this group contains —I, 
there are no twisted modular forms of weight {k,£) for k + £ odd. Theorem 3.2 and 
the arguments in the preceding proof immediately imply the following statement. 

Proposition 7.2. The ring of weight modular forms of even total weight k + £ 
for F has the graded dimensions 

‘\h A- £ 

dimM(fc,,)(F) = 1 + . 

It consists of the invariants of (11) as given in (54) under the involution 

t^t-\ f^ft^/^ 
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with s as in (35). (The numerical factors in the definition of rj have been included 
for later convenience.) In particular, when we embed the Teichmiiller curve into 
the Hilbert modular surface, then the restriction of a Hilbert modular form of 
weight {k, k) is a homogeneous polynomial of degree fc in ^ and rj, or equivalently 
equals times a polynomial in s of degree at most k. We will use this in the next 
section to find an explicit description of this embedding. 


8. The Hilbert modular embedding of Wl-j 

We continue to study the example of the special Teichmiiller curve W = IH/y, 
using the information given in the previous section to give a complete description 
of the embedding of W into the Hilbert modular surface Xn. 


8.1. Modular embedding via Eisenstein series. We can use our explicit knowl¬ 
edge of the twisting map (/? : IH —>■ El” and of the twisted modular forms on H and 
on r to embed H/H and H/T into Xiy by expressing the restrictions of Hilbert 
modular forms as polynomials in /, /, and t (or simply in ^ and r] if we restrict our 
attention to forms of parallel weight). We begin by recalling the main points of the 
theory of Hilbert modular forms, using SL(2, 0) rather than SL(0'' © 0) since it is 
more familiar, and then later transform our results back to SL(0'' © 0) for D = 17, 
using the isomorphism of the two groups in this case. A textbook reference for the 
notions and claims in this section in e.g. [39]. 

A Hilbert modular form of weight {k,i) on the full modular group SL(2, Od) is a 
holomorphic function F : —>■ C satisfying the transformation law F{'yzi, 7 “^ 2 ^ 2 ) = 
{czi + df{d^Z 2 + d'^YF(zi,Z 2 ) for 5 = (“^) G SL(2,0d). li k = £ (“par¬ 
allel weight”), we call the weight simply k. We denote the space of such forms 
by Mfc_£(SL(2, 0 _d)), or simply Mfe(SL(2, O^i)) ii k = £, and more generally write 
Alfc,f(r,x) for the corresponding space for forms with respect to a subgroup T of 
SL(2,0 d) and character y : SL(2,0 )d —>• C*. On Mfc(SL(2, 0 _d)) we have the 
involution induced by the symmetry l : ( 21 , 22 ) '—>■ iz 2 ,zi) of H^, so we can split 
this space into the direct sum of the subspaces M^(SL(2, On)) of symmetric and 
antisymmetric Hilbert modular forms. 

A Hilbert modular form F on SL(2, On) has a Fourier expansion of the form 

F(z) = F(zi,Z 2 ) = bo + ^ b^e(tr(iyz)) (60) 

I'GO'f, i^>0 


where Of, is the inverse discriminant and tr(uz) for z = (21,22) G means 
U 2 i + u'^ 22 . If we choose a Z-basis for Of, and write X and V for the corresponding 
exponential functions e(tr(uz)), then the right-hand side of (60) becomes a Laurent 
series in X and Y (or even a power series if the basis is chosen appropriately). In 
practice it is sometimes more convenient to choose only a Q-basis for Of,, in which 
case we work with power series with congruence conditions on the exponents of X 
and V. The simplest choice is 


V = Y(z) 


/Zl- Z2 

V 2VF 


) 




(61) 
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in which case {e(tr(i^z))} = {X^Y^ \ m = n (mod 2)} and (60) becomes 


F{z) 


E 

m—O 



G c[y,y-i][[x]]. 


(62) 


For Hilbert modular forms the same remarks as in §5.1 for Hilbert modular 
surfaces apply concerning the (less standard) Hilbert modular groups SL(0^ © 0) 
or functions on EIxEI“, the only difference in the latter case being that the condition 
V 0 in (60) must be replaced by v > 0 > 

The simplest Hilbert modular forms to construct are the Eisenstein series of 
weight {k, k) (A: = 2, 4, 6,...), with Fourier expansion given by^ 

£fc(z) = + E crf_i(i/v^e(tr(i^z)). (63) 

i/>o 


Here C,k{s) denotes the Dedekind zeta function oi K = Q{'/D) and cr^_i (v'/D) for 
i' G 0^ with 0 is given by 

b\'uy/D d\u'/D 


where the first sum runs over integral ideals b of it" dividing the integral ideal i/'/D 
and the second sum (whose equality with the first is proved in [44], Lemma on 
p. 66) runs over positive integers d such that d~^v G 0^, and where (Tk-i(rn) for 
m G N has its usual meaning as the sum of the (fc — l)st powers of the (positive) 
divisors of m. In particular, for D = 17 the first three Eisenstein series £fc = 
have Fourier expansions beginning 

£2(z) = ^ + (31"^ + 7Y + 7Y-^ + X + (r® + 2iy® + i4r^ 

+ 45y^ +18 + 45r"^ + i4r-4 + 2ir-® + y-®) + ..., 

41 

£4(2) = + (9y® + 73y + 73y-i + 9y-®)x + (y® + 657y® 

+ 2198y'‘ + 5265y2 + 4914 + 5265y"2 + • • • + y®) + • • • , 

£6(z) = + (33y® + 1057y + 1057y-^ + 33y-®) X + (y® + 3488iy® 

+ 371294y‘ + 1116225y^ + 1419858 + • • • + y®) + • • • , 

with X and Y as in (61). As a check, we can verify that if we set X = q and 
y = 1, corresponding to the natural embedding z i-A {z,z) of H/SL(2,Z) into 
H^/SL(2, 0), then these Fourier expansions agree to the accuracy computed (several 
hundred terms) with those of the classical SL(2,Z) modular forms ^E 4 , 
and ^Ef + respectively. 

We now compute the restrictions of these Eisenstein series to the Teichmiiller 
curve W = H/F (or rather, of its double covering H/H) that we studied in §6. As 
explained there, the algorithm used produced F as a subgroup of SL(2, 0), rather 


®For general discriminants there are several Hilbert-Eisenstein series for each value of k, with 
sum £(). If the class number of D is one, as is the case for D = 17, there is only one. 
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than SL(0^ © 0), so that the twisting function (p{z) maps H to H (cf. eq. (52)). 
Hence we must use the embedding of W into ]HI^/SL(2, 0) by 

2 : H> {zi,Z 2 ) = {sz, e^^ipiz)) ( 2 :GIH), (64) 


where e = 4 + y/Vf is the fundamental unit of Q(vT7). Using the expansions of Q 


and Q as power series in t that were given in §7.1, we find that the monomial 
^myn _ e(tr(i^z)) for = ^ + G 0 ^ has the t-expansion 


.-= 1 +^ /5-yT7^ 


/ ! - X 21m + 5n ( ^ — \/T7\ 

= (-2(3 + 717)) ^ 


2 lTn + 5Ti 

t 2 


(65) 


, , 16597m + 3985n - (3827m + 919n)7T7 

+-ik-+ 


(More precisely, this is true under the assumption that the constants A and A 

relating q = e{z/a) and q — e((p(z)/a'^) to Q = tH-and Q = t-\ -indeed have 

the values given in (49) and (51); we will return to this point below.) Inserting the 
expansion (65) into the Fourier development of £fc(z) or any other Hilbert modular 
form, we can compute its restriction to IF as a power series in t. On the other 
hand, as we saw above, the restriction of any F G Mk{SL{2, 0)) to IF is a linear 
combination of monomials with i + j = k, where ^ and iq are the functions 
defined in (59), whose expansions in t are known, so we can find the coefficients 
of this linear combination by linear algebra. The result of the computation for the 
first three Eisenstein series is 


12 £9 


w 


^2 11 + 717 , , 2 

= ?- ^+ V , 

1855 + 365717 


120£4|^ = 4l7 - + 

, 151 + 35717,3 , 4 

+ ---^7 + 417 


18245 + 3979717 ,2 2 

-5-? V 


252 £« 


TV 


= 57917 - 


6 867831 + 173541717 ,5 11350461 + 2429643717 ,4 2 


77 + 


16 


■77 


1883335 + 652933717 ,3 3 69270195+ 16881483717,2 4 

S ^ F A ^ V 


16 

1983525 + 452397717, 5 e 

--e7 + 57917 . 


64 


(It was to simplify the coefficients in these polynomials that we introduced the 
factor c in (59). The coefficient of 7 here is just the constant term of £fc, and 
in particular rational, and the coefficient of rj^ has the same value because the 
matrix (J }) belongs to SL(2, 0), though not to F, so that the constant terms at 
the two cusps s(0) = 0 and s(I) = 00 of the restriction of any Hilbert modular form 
to IF are the same up to scaling.) Now by elimination we find a polynomial 


P(£ 2 , £ 4 , £ 0 ) = (3465994203567- 840620808790 717) £5 

+ ••• + (7395484320944244318526129490625 
- 1711627845603248913114298550625 717) £7 
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(in which we have omitted seventeen equally gigantic intermediate terms) whose 
restriction to W vanishes. We have thus obtained an explicit algebraic equation 
cutting out the Teichmiiller curve W on the Hilbert modular surface X 17 , but it is 
too big in the sense that its vanishing locus is reducible and W is only one of its 
components. Indeed, we know from the results of Bainbridge [2] that there must 
be a symmetric Hilbert modular form of weight 12 which vanishes precisely on the 
Teichmiiller curve and its image under the involution i : (zi, Z 2 ) 1 — (z 2 , Zi), whereas 
the above equation has weight 24. The reason for this is twofold: 

• the Eisenstein series generate only a subring of the full ring of symmetric Hilbert 
modular forms, and there is no reason that the minimal defining equation of W 
should belong to this subring; and 

• we did not even use all the Eisenstein series, but only £ 2 , £4 and £ 5 . 

The second point can be dealt with by expressing the restriction of each £k to W 
as a polynomial in ^ and ry and looking for the first weight in which some lin¬ 
ear combination of monomials in these restricted Eisenstein series vanishes. This 
weight, however, still turns out to be 14 rather than 12, and the answer is not even 
unique: there is a 2-dimensional space of linear combinations of £ 2 ,..., £14 that 
vanish on W, all having huge coefficients so that we do not reproduce them here. 
(As a side remark, it is actually surprising that there should be even one relation 
in such a low weight, let alone two, since there are 15 monomials of weight 14 in 
r 2 ,..., ri 4 and also 15 monomials of weight 14 in ^ and 77 , so that one would not 
expect the former to lie in a non-trivial subspace, let alone a subspace of codimen¬ 
sion 2. The first weight in which there has to be a relation for dimensional reasons 
is 16. Similarly, it is surprising that we found a relation among £ 2 , £4 and £5 in as 
low a weight as 24, since a priori the first weight in which the number of monomials 
in these three forms is larger than the number of monomials in ^ and ry is 38. This 
suggests that the restrictions of the Eisenstein series to W have some non-generic 
property, but we do not know what it is.) To address the first point, we need to 
have a full set of generators of the ring of Hilbert modular forms. Finding such a 
set of generators in general is a comparatively difficult (though always algorithmi¬ 
cally solvable) problem, but in the case D = 17 the result has been obtained by 
Hermann [16]. We will describe his results in the next subsection and use them 
to determine the symmetric Hilbert modular form of minimal weight 12 vanishing 
on W. 

Another pertinent remark is that, although we have so far only used Hilbert mod¬ 
ular forms of “parallel” weight (fc, fc), whose restriction to IT is a polynomial in ^ 
and ry, one can equally well consider Hilbert modular forms of mixed weight (fc, £) 
with k £, in which case the restrictions become polynomials in /, / and For 
example, the Rankin-Cohen bracket 2 E 2 {z)E 4 {z) — E 2 (z)i? 4 (z), where the prime 
denotes differentiation with respect to the first variable Zi, is a Hilbert modular 
form of weight (8,6), and its restriction to W could be computed explicitly as 
f(z)^f{z)^ times a Laurent polynomial in t(z). Such mixed weight forms will play 
a role in Part HI of this paper, e.g. in Section 9, where we will use a different con¬ 
struction to find explicitly a non-symmetric Hilbert modular form of non-parallel 
weight (3, 9) which vanishes precisely on W, again in accordance with the general 
results of Bainbridge. 
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8.2. Hilbert modular forms for the discriminant 17. We continue to work 
with the Hilbert modular group F_d = SL(2, 0) for the case D = 17, i.e., for 0 = 
Z[a] with Of = (1 + We will describe the structure of the ring of symmetric 

Hilbert modular forms of parallel even weight, following Hermann [16], who obtains 
these modular forms by restriction of Siegel modular forms of genus 2. Later we 
will look also at non-symmetric Hilbert modular forms and Hilbert modular forms 
of odd or non-parallel weight on Fiy. 

Hermann begins with the sixteen genus 2 Siegel theta series 

Qm,m'iZ,v) = ^ e(^^xZx* + x{v+ . ( 66 ) 

Here m, m' G {0, and the independent variables Z and v are in the Siegel 
half-space H2 and in C^, respectively. Ten of these (those for which the theta 
characteristic {m,m') is even, i.e. 4m ■ m' = 0 (mod 2)) are even functions of v 
and hence give Siegel modular forms of weight ^ after restricting to w = 0. (The 
other six are odd and hence give 0 on restriction, but their derivatives with respect 
to V give non-trivial restrictions that will play a crucial rule in the constructions 
of Part HI of this paper.) Using a modular embedding from to IH2 like the one 
described in the previous sub-section, we get ten Hilbert theta series, all of weight ^ 
with respect to a suitable subgroup of T^i. 

It is convenient to re-index the sixteen theta characteristics in a way that makes 
the action of Td more transparent. This part works for any D = 1 (8), i.e., for D 
such that the prime 2 splits as 7r27rf vn K = Q{'/D) for some prime ideal tt2 ^ tt ^ 
(in our case, the principal ideal generated by 1 -I- a). Dehne sets S and S by 

S' = {0,1, 00}, S = SUjX} 

where X is a symbol, and let F £> act on S x S by fixing X and identifying S x S 
with P^(0/712) X (0/(7(712)). We match the usual indexing by characteristic with 
these symbols by Table 2, in which the even characteristics correspond to the pairs 


m\m' 

X 

0 

1 

00 

X 

1111 

1011 

1010 

1110 

0 

0111 

0011 

0010 

0110 

1 

0101 

0001 

0000 

0100 

00 

1101 

1001 

1000 

1100 


Table 2. Reindexing of theta characteristics 


(a, 6) € S^ in which a and b are either both equal to or both different from X, 
and the odd characteristics to the pairs (a, b) in which exactly one of a and b is 
equal to X. This gives us by restriction 10 Hilbert modular forms Ox,x and 9a,b 
(a, 5 G S) of weight 1/2. 

Set Q = 9\ X and for each permutation tt of the set S set ?7,r = ± OseS •"•(«)) 
where the sign is -1-1 if tt is the identity and —1 otherwise. Up to powers of f, the 
Hilbert modular group preserves 0 and permutes the tIt^, preserving the parity of tt, 
so that if we set 

? 7 ± = ^ ± ^ 77 ^ 

TT odd 


TT even 
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then 0, rj-\. and rj- are Hilbert modular forms, with multiplier systems, for the full 
Hilbert modular group. More precisely, we have 

0 G Mi(rD,vo), ?7± G M3(rD,uo), r]+ri_ G , (67) 

where 

Vo'-To^m, J f) H> for a: G 0 (68) 

is a character of order 4. The form 9x,x is antisymmetric with respect to the involu¬ 
tion L and hence vanishes on the diagonal ]HI/SL(2,Z) C H^/SL(2, Od)- Moreover, 
in the case D = 17 this is its full vanishing locus, so that any Hilbert modular 
form vanishing on the diagonal is divisible by 9x,x and any symmetric Hilbert 
modular form vanishing on the diagonal is divisible by 0. For example, since the 
restrictions of both and rf^ are proportional to (where we use i?4, Eq and 
A = (i?4 — i?|)/1728 to denote the standard generators of M*(SL(2,Z))), some 
linear combination of them, which turns out to be — 4ry^, vanishes on the diag¬ 
onal and hence is divisible by 0. This gives us the following five symmetric Hilbert 
modular forms of even weight and trivial character: 

G 2 = , G4=ry+?y_0, = Gq = , He = r]^r]+, 

where the index of each form indicates its weight. 

Theorem 8.1 (Hermann, [16]). The ring M^iTn) = 0j,>q M^(ri7) of symmet¬ 
ric Hilbert modular forms of even weight for D = 17 is generated by the five Hilbert 
modular forms G 2 , G4, Hi, Ge and He, with the relations 

GiGe = HiHe, Gl = ^Ge{He + G2 Gi), Gj = H^ (AGl - G2 Ge). 

In particular, Mg(,(ri7) is a free module of rank 4 over the algebra G.[G 2 ,Hi,He\, 
with basis {1, G4, Ge, G4}. 

Sketch of proof (following Hermann). The relations among Hermann’s five forms, 
like any relations among modular forms, can be verified algorithmically by looking 
at a finite part of the Fourier expansions of the functions involved, so we only have 
to show that these forms generate the whole ring. Let F = Eg G Mfc(Fi7) be a 
symmetric Hilbert modular form of even weight k with trivial character. Then the 
restriction of F to the diagonal has weight 2k divisible by 4, and since the ring of 
modular forms on SL(2,Z) of weight divisible by 4 is generated by the forms E 4 
and A, which are multiples of the restrictions to the diagonal of G2 and He, there 
is a weighted homogeneous polynomial Pg(G 2 ,He) in G2 and He whose restriction 
to the diagonal coincides with that of Eg. By what we said above, we then have 
1^0 = Po{G 2 ,He) -\- QFi for some Hilbert modular form F’l G The 

restriction of Fi to the diagonal has the character of ^fK and weight congruent 
to 2 modulo 4, so by the same argument as before coincides with the restriction 
of ri+r]-Pi{G 2 , He) for some weighted homogeneous polynomial Pi{G 2 , He). This 
implies in turn Fi = ri+r]-Pi{G 2 , He)-\-&F 2 for some F 2 G Mk- 2 {Ti 7 , Vg). A similar 
argument shows that F 2 has the same restriction to the diagonal as 'q\'qfP 2 {G 2 , He) 
for yet a third polynomial P 2 , so F 2 = ry^ry^P2(G2, iJe) + ©Ag for some Eg G 
Mfc_3(Fi7, Vo), and a final iteration gives a fourth polynomial Pg such that Eg = 
iy^T3(G2, He) -b 0F4 for some Hilbert modular form F4 of weight k — A, now again 
with trivial character. Combining these successive identities we have written F 
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as Pq + G 4P1 + G4P2 + GqPs + HiFi where each Pi belongs to C[G2,P6] and 
P4 £ A^^4(ri7), and now iterating the whole argument we see that P is a linear 
combination of 1, G4, G4 and Gq with coefficients in C[G2, P4, Hq] as claimed. □ 


Example. The Fourier expansions of rj- and 0 begin 

TJ+ = -4 {{Y + Y-^) + {13Y"^ - 19Y^ - 19Y-^ + X + ■■■) , 

r]- = 16 X^!^ (1 - {Y^ YiY^ - Y -Y-^ + + F"®) X + ■ ■ ■) , 

0 = 4X^/2 ((f2 _ 2 + y-2) _ 2{Y^ -Y- Y-^ + Y-^) X + ■■■) . 

(As a check, if we set F = 1 and X = q then these reduce to —Sry®, IGry®, and 0.) 
Comparing with the expansions of the first three Eisenstein series £fc given above, 
we find that these forms are given in terms of Hermann’s generators of by 


-192 £2 = G2, 
14336 £6 = -^^G^ 


640£4 = ^*^2 - 39G4 - 57P4 


8571G4G2 


11463 P4G2 - ^^^Ge - 6285 Pe- 


72 z ■ , ^ ^ 4 

Corollary 8.2. The function field field of the symmetric Hilbert modular surface 
with D = 17 is the rational function field is generated by the two functions 

03 


U = 


E± 

Gi 


1-^+ 


V = 


He 

He — G2G4 


j^2 

4??+ 


(69) 


Proof. The relations among Hermann’s generators imply 


(G 4 

Hi 

Ge 

He\ 

/ UV 

U^V 

{72]/2 jjy2 

VGi’ 

Gi ’ 

Gi’ 

Gp - 

U(F- 1)2 ’ 

4(F- 1)2 ’ 

' 4(F- 1)3 ’ 4(F- l)3j 


so the corollary follows immediately from the theorem. □ 


8.3. The equation of the Teichmiiller curve. The corollary just given tells 
us that the Hilbert modular surface for P = 17 is rational, with coordinates U 
and V. In particular, the image of the Teichmiiller curves on this surface 
must be given by polynomial equations in these coordinates. In this subsection we 
will give these equations, which turn out to be several orders of magnitude simpler 
than the previously obtained equation P{£, 2 , £- 4 , ^-e) = 0. We will also describe 
better systems of Fourier coordinates and will resolve two questions that we raised 
earlier by showing that the values of A and A given in (49) and (51) on the basis of 
numerical computations are indeed correct and by giving a purely modular proof 
of the integrality (away from the prime 2) of the Taylor expansions of y and y as 
power series in t. 


Theorem 8.3. On the (rational) symmetric Hilbert modular surface with coordi¬ 
nates U and V, the Teichmiiller curves and Wf-j are given by the equations 


wh 


V + 


5 + ^17 






1 - VT? 


= 0 


(70) 


W%: 


V 




= 0 . 


(71) 


and 
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Proof. Since we gave the expressions for the Hilbert-Eisenstein series £2, £4 and 
£5 in the Hermann generators in the last subsection, we could derive (70) from 
the results of §8.1 giving the restrictions to the curve of these Eisenstein 
series. However, it is much simpler to work directly with Hermann’s generators, 
obtaining their Eourier expansions from those of ri± and 0 as given above and then 
using (65) to obtain the f-expansions of their restrictions to W. The results of the 
computations are given in the following table, in which we have used the results 
from the end of §7.3 to write the expansion of the restriction of each of G 4 , H 4 , 
Gq and Hq to W (that of G 2 = —192£2 was already given above) as a power of f 
times a polynomial in s: 


F 

eG4/2^7r« 

£F4/2^7r§ 



^-''F|IE 

s(s — I)(s — Kl) 

s(s - I) 


s2(s — I)^(s — Kl) 


Here = 712 ^ From the definitions of the Hilbert modular functions 

U and V it then follows that their restrictions to W are given by 


U 


w 




-I + VT 7 s(s-l) 

8 (s-«;i)2 ’ 


(72) 


and equation (70) follows immediately. Equation (71) is proved in a similar way 
using the Galois conjugate differential equation, as discussed in the remark at the 
end of §5.5; the resulting expansions are the Galois conjugates of those for and 
since the Hermann generators have rational Fourier coefficients the Hnal equation is 
necessarily also the Galois conjugate of that of Wf^- In fact, this Galois conjugation 
property holds for all D, as was already recalled in Theorem 5.4 (iv) of §5.4. □ 


We end this section by discussing four points related to the equations given in 
Theorem 8.3. 

1. Equation (70) describes a Hilbert modular function that vanishes precisely 
on the curve Wf’j. We can also ask for the holomorphic Hilbert modular form of 
smallest weight with the same property. If we multiply the left-hand side of (70) 
through by then by equation (69) the result is 






VT7 


Q^rj+r]_ + 


1 - y/rj 


2 J2 




(73) 


and by Theorem 8.3 this vanishes precisely on According to (67), is a 

holomorphic Hilbert modular form of weight 6 on the full Hilbert modular group, 
but with quadratic character Vq, where vq is defined as in (68). If we further 
multiply it by 0^, then we get a Hilbert modular form of weight 8 on the full 
Hilbert modular group and with trivial character, given in terms of the Hermann 
generators by 


0 %\ 


= 4G2G6 


VT7 




Gl 


but this form now vanishes not only on but also (doubly) on the diagonally 
embedded modular curve EI/SL(2,Z) C ]Hl2/SL(2, 0). On the other hand, if we 
multiply by its Galois conjugate F°y = (F^y)'’’, then the product F17 vanishes 
precisely on the full Teichmiiller locus Wiy = lEfy U and this is now a holo¬ 
morphic Hilbert modular form on the full modular group SL(2, 0) and with trivial 
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character, given in terms of the basis of Mi2(SL(2, 0)) from Theorem 8.1 by 
Fn = (explicit polynomial in G2, G4, Fl^^ Gq, F[q with rational coefficients). 


We do not write out the polynomial, since it is a bit complicated, but observe that 
it involves only 11 of the 16 generators of Mi2(SL(2, 0)). The fact that here there 
is a single Hilbert modular form of weight 12 whose vanishing locus is precisely the 
union of the Teichmiiller curves on Xjy is a special case of the theorem of Bainbridge, 
already mentioned in §8.1, stating that such a form Fjy exists for every D. We will 
give a different proof of this in Part 111 by constructing Fd e Mi2(SL(0''©0)) in 
general as a product of twelve derivatives of theta series of weight (5, |) or (§,5). 

2. The next point concerns the choice of coordinates for our Fourier expansions. 
We replace the previously used coordinates X and Y from (61) by the new Fourier 
variables 

Xi = X-^Y^^, Yi = 

This has several advantages. First of all they form a Z-basis of the group of Fourier 
monomials e(tr(a;z)), whereas X and Y generated a subgroup of index 2. Secondly, 
symmetric Hilbert modular forms of even weight are symmetric in Xi and Ti, as 
one sees using the action of e . For instance the (Xi, yi)-expansions of the first two 
Hermann generators begin 

-ife <^2 = -^ + YiXi + {9Y^ + STi^) X^ + {3Y^ + lOlf + 15^/) Xf + ■ ■ ■ , 
^ G4 = - TiXi + {-9Y^ + xf + {Y^ - UY^^ + F^) xf + ■ ■ ■ , 


in which the coefficients of Y^Xf and Y^Xf are the same. Thirdly, and most im¬ 
portant, both are holomorphic near the cusp of W and hence have power series 
expansions in t, with valuations 1 and 0 there rather than 21/2 and 5/2 as for X 
and F. This has to do with the Hirzebruch resolution of the cusp singularities, 
according the which different Z-bases of the group just mentioned are good coordi¬ 
nates in different parts of the resolution cycle. (We will discuss this in much more 
detail in Section 11.) Here we have to choose the coordinates that are adapted to the 
point of the cusp resolution through which W passes. Explicitly, these expansions 
are 


Xi = X-^Y^^ 
Fi = X^Y-^'^ 


11+3vT 7 jL I 403-229 vT7 ^2 , 

64 '' 2048 '' 

21-5vT7 _ 895-217vT7 ^ , 

2 8 I -I- • • • . 


(74) 


(For comparison, the leading terms of X and F are .^ 524445 22^0267v^ ^2i/2 

respectively.) It is then very easy to restrict a Hilbert modular 
form F given in its (Xi,Fi)-expansion to IF: to get the expansion up to order 
it suffices to expand F up to Xf in C[Fi][[Xi]]. 

3. The third point concerns the correctness of the values of A and A in (49) 
and (51). We have been assuming throughout (for instance in (74)) that these 
guesses were correct, and now finally prove it. We write 


Xi = — Cl t exp^eciA— + (eaX)'^ , 

Fi = 21-5VT7 ^ exp(ga^ — + [ea^iV , 

V y y J 


(75) 
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where A = and ^ (the factor £ in the exponent takes care of 

the identification of H x H and HI x HI”, the factor a is the cusp width, and the 
factors A and /i come from the passage from {X, Y) to (Xi, Yi)) and where ci and C2 
are numerical factors that are both equal to one if (and only if) the formulas for A 
and A in (49) and (51) are correct. (One could—and we originally did—also do the 
whole calculation without including the prefactors — and in (75), 

but then the numbers in the calculation would be even worse.) We substitute these 
expressions into G 2 , divide by with ^ as in (59), and write the quotient as a 
power series with s = as usual. By Proposition 7.2, we know that 

this power has to reduce to a quadratic polynomial if ci and C2 have the correct 
values. If we instead treat ci and C2 as unknowns, then the coefficients of this power 
series are polynomials in ci and C2 (with huge coefficients), and we have to show 
that the infinite system of polynomial equations Ci{ci, C2) = 0 for all i > 2 has the 
unique solution ci = C2 = 1. By computer calculation we find that the g.c.d. of the 
resultants ReSci (Ca, (74) and ReSci ((74, Ca) is equal to —2“^^^C2(c2 — 1), and since 
C2 cannot be 0, it must be 1. Then substituting C2 = 1 into the g.c.d. of C 3 and C 4 
gives Cl = 1. 

4. The final point concerns the integrality (away from the prime 2) of our 
Fourier expansions. The coefficients of the expansion of any Hilbert modular form 
in Xi and Yi with rational coefficients always has bounded denominators, like in 
the examples for G 2 and G 4 above. The same is true also for Hilbert modular 
functions, e.g. 

U + 1 = (Yi - YD Xi + (-Yi - 15Y2 + - y/) Xj + ■ ■ ■ , 

i Y = Yi Xi - (22Y]2 + 2YD X2 + {-2Y^ + 289Yf + 12Y(‘ + Y^ X)^ + • • • 

for the generators of the field of symmetric Hilbert modular functions given in 
Corollary 8.2 above. We introduce new generators of this function field, namely 
Ui = l— and Yi = \V. (Other choices would be equally good.) 

Proposition 8.4. In the new set of generators the equality 

Z[[Xi,Yi]]^y“ = Z[[C/i,Yi]] 

of power series rings holds. 

Proof We have Z[[Xi, Yi]]"y“ = Z[[S, P]], where S' = Xi + Yi and P = XiYi. We 
have already seen that U and Y, and hence also Ui and Yi, are symmetric in Xi 
and Yi. We can thus express Ui and Yi as a power series in S and P. Concretely, 
these expansions start 

Ui = S + (-7 + 3S + 3S2) P + (13 - 65S + + 6 S^)P‘^ + • • • , 

Yi = P + (-22 - 2S) P2 + (289 + 12S + S^) P^ + • • • . 

They have integral expansions, so Z[[Pi,Yi]] C Z[[S,P]]. Conversely, since the 
expansions begin S + 0(P) and P + O(P^), we can recursively compute S and P 
as power series in Pi and Yi, and these again have integral coefficients: 

5 = Pi + (7 + 3Pi + 3Pi) Yi + (120 - 2OP1 - 82U^ + 6Pf) Yi^ + • • • , 

P = Yi + (22 + 2Pi) Yi^ + (693 + 158Pi + Pi^) Yi^ + • • • . 

It follows that Z[[Pi, Yi]] A Z[[S', P]], as desired. □ 
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Corollary 8.5. The restriction to Wn of any symmetric Hilbert modular form of 
even weight with integral Fourier coefficients belongs to where R = 0i7[5]. 


Proof. Since kq G , eq. (35) implies that = -R[[s]]- Since ki G R^ , eq. (72) 
implies that the restrictions of Ui and Wi to W belong to this ring. Since any 
Hilbert modular form has a Fourier expansion with exponents in a cone strictly 
contained in the positive quadrant (explicitly, if F = ^ then ar,s = 0 

unless r < s < r), it contains only finitely many monomials contribut¬ 

ing to any fixed power of t in the t-expansion of its restriction to W. (Explicitly, 
XfYf -\-XfYf is divisible by ^ P\w~ corollary follows. □ 

Proposition 8.6. The power series y{f) and y{t) have On-integral expansions up 
to denominator 2, i.e. y{t),y{t) G R[t], where R = 


Proof. The differential operators Di = .9 can be written as 


Di = XXi 


dXi 


fYi 


d 

m’ 


2-Ki dzi 

D2 = A" 


27rz dz2 


where A = ^ _ -2i+5vT7 ^ g^^ hence map Vi]] to itself. On the 

other hand, they send Hilbert modular functions to meromorphic Hilbert modular 
forms of weight (2,0) and (0,2) respectively. By Proposition 7.2, the quotients 
of the restrictions to Wn of these derivatives by (resp. ip) must be rational 
functions of t. We have to make the right choices of these functions in order not to 
introduce unwanted denominators. Since l/U and VjU'^ restrict to polynomials in s 
(cf. eq. (72)), we choose these as the Hilbert modular functions to be differentiated, 
finding 


V 


2 



Wn 





Wn 


where c = 29+1^^^ g _ 7+vT7 g^^^ jg polynomial 

(1 _ ^1+^t) (1 - 647-1|3vT7 ^ ^ ^2) ^ 

Since c and c belong to R and the polynomials in the denominators are in 1 -\-tR[t], 
it follows that and and hence also y and y, are in □ 


P{t) = t{l + t){l- 


Although we have been working with D = 17 all the time and using concrete 
generators of the field of Hilbert modular functions, it is clear from the proof 
that the basic principle—the use of integral coefficients for Hilbert modular forms 
and the base change in two variables—can be applied for any D, giving the Od- 
integrality (up to finitely many primes in the denominator) of solutions y(t) and 
y(t) of the corresponding Picard-Fuchs equations for any D. 
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Part III: Modular embeddings via derivatives of theta functions 

In Part II we deduced the equation of the Teichmiiller curve in the Hilbert 
modular surface for D = 17 and of a Hilbert modular form cutting out the Teich- 
miiller curve starting from the differential operators L and L, which in turn were 
deduced from the explicit algebraic model of the family as given in (38). In Part HI 
we will show that there is a general construction of a Hilbert modular form of mixed 
weight that cuts out the Teichmiiller curve. The construction, given in Section 9, 
uses derivatives of theta functions. The short proof depends on the description of 
Teichmiiller curves using eigenforms for real multiplication with a double zero (see 
Theorem 5.2). We also verify that in the case D = 17 we get the same equation for 
the Teichmiiller curve as the one already obtained in Section 8 . 

Next, in Sections 10 and 12, we develop the theory of Teichmiiller curves in 
genus two “from scratch” starting from the new definition as vanishing loci of theta 
derivatives. In particular we give new proofs from this point of view of the cusp 
classification and of the facts that these curves are Kobayashi geodesics and are 
disjoint from the reducible locus and hence are Teichmiiller curves. We do not 
know how to reprove the irreducibility from the viewpoint of theta functions. 

Along the way, in Section 11 we show that Bainbridge’s compactification of 
Hilbert modular surfaces using the moduli space of curves is indeed a toroidal 
compactification. Recall that this property of Hirzebruch’s compactification was 
the model on which the notion of toroidal compactifications was developed. 


9. TEICHMULLER curves are given by THETA DERIVATIVES 

Bainbridge has shown in [2], Theorem 10.2, that the Teichmiiller curves Wd 
defined in Section 5.3 are given as the vanishing locus of a modular form (3,9) for 
all D. We determine this form explicitly. It turns out to be a product of derivatives 
of theta series restricted from the Siegel half space to Hilbert modular varieties. 

9.1. Theta functions and their restrictions to Hilbert modular varieties. 

We recall the definition of the classical theta-functions and properties of their 
derivatives. Although we are ultimately interested vci g = 2 only, we can keep 
g general without effort when setting up the definitions. 

For m,m' € ( 5 ^)® (considered as row vectors) we define the (Siegel) theta 
function 

r C9 X Hg ^ C 

: S {v,Z) !->• ^ e (^^xZx^ + x{v + m')^) . 

y x^'L3-\-m 

with characteristic The evaluation of a theta-function at u = 0 is called 

a theta constant. The theta-function (and the characteristic {m,m')) is called odd 
if Am{m')'^ is odd and even otherwise. Odd theta-constants vanish identically as 
functions of Z. Up to sign, depends only on m and m' modulo Z®. 

The theta constants 0(m,m')(O, Z) are modular forms of weight ^ for some sub¬ 
group (in fact r(4, 8), see e.g. [20]) of Sp(2g, Z). The partial derivatives with respect 
to any Vi are not modular, but if we restrict to u = 0 and consider the gradient (as 
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column vector) 

then we get a vector-valued modular form. That is, if Z) = 0, one 

calculates that for any M = (^ £) G r(4, 8) the gradient transforms as 

v(0(^.™o)(O>Af-Z) = Cs det(CZ + i?)i/2(C'Z + i4)v(0(„,™,))(O,Z), 

where Cs is an 8 -th root of unity depending on M. 

If iG is a totally real number field of degree g over Q with ring of integers 0, 
then just as in the special case g = 2 we can define a g-dimensional Hilbert 
modular variety Xk = ]HI®/SL(0^ © 0) and a Siegel modular embedding (4','i/)) 
of Xk into ]HIg/Sp(2g,Z), given by a matrix B G GL(g,IR) as in (25). Recall 
that this means that if : SL(C}^ © 0) —>■ Sp( 25 ,Z) is a homomorphism and that 
i/’(z) = H^diag(zi,..., Zg)B is a map that is equivariant with respect to 'I'. We then 
denote by 6»(„,™q(z) = 0(^_„/)(0, V>(z)) and V6»(™,„/)(z) = V0(™,„/)(O, i/>(z)) the 
restriction of the theta functions and their gradients to H®. We also write 
if we want to emphasize the dependence on the modular embedding. The modular¬ 
ity of the Siegel theta functions imply that the (Hilbert) theta constants 0(m,m')(z) 
are modular forms with a character of order 8 of weight (^,..., 2 ) for a subgroup 
of finite index of the Hilbert modular group SL(0^ © 0). The theta constants are 
non-zero if and only if (m, m') is even, while the theta gradients are modular if and 
only if (to, to') is odd. 

The modular transformation of the derivative of theta constants for (to, to') odd 
now reads 

V 6»(„,^/)(7 • z) = Cs det(cV'(z)+ dy/ 2 H"V(z, 7 )H V 6 l(^_^q(z) 

for 7 = (“ ^) in the subgroup d>“^(r(4, 8 )) C SL(0''©0), where e for e G RT denotes 
the diagonal matrix with entries (Jj{e) given by the different real embeddings Uj of 
K and J = is the diagonal matrix with entries crj(c) 2 j+(Tj(fi). Consequently, 

the vector-valued modular form B transforms with the automorphy factor 

Cs det(ci/>(z) + d)^/^ J(z, 7 ), which is also a diagonal matrix. We will calculate the 
root of unity Cs for K = Q{'/D) with D = 1 (mod 8 ) in detail below. 

To summarize, the z-th entry „j/)(z) of the column vector BX6^ is a 

Hilbert modular form of multi-weight and a character stem¬ 

ming from the 8 -th root of unity. 

We can also express the functions Di6(^m,m') as derivatives in certain eigendirec- 
tions. For this purpose replace the original coordinates v = {vi, ... ,Vg) G C® by 
the “eigendirection coordinates” u = Hv = (ui, . .., Ug). Then we may write 

d 

0(m,mq (^7 ^(^)) |u—0 ■ 

9.2. Wd is the vanishing locus of the theta-derivatives. With this prepara¬ 
tion we can now determine Bainbridge’s modular form for general D. 

Theorem 9.1. The function 

H0(Z) = l[ 

(m,m') odd 


( 76 ) 
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is a modular form of weight (3,9) (with character) for the full Hilbert modular group 
SL(0^ © 0_d)- Its vanishing locus is precisely the Teichmiiller curve Wd- 

We will give additional properties of the modular form T)9 if the prime (2) splits, 
i.e., if I? = 1 (mod 8). In that case, as already mentioned in §8.2, the discussion 
of the numbering of theta characteristics given there for D = 17 always holds. In 
particular, one has a quartic character vq given by (68). Here, HO is a product of 
two modular forms and its character is Vq. Indeed, using the shorthand notation 
introduced in §8.2 we calculate the action of generators of SL(0^ © 0) on the 
functions D29(^rn,m')- From Table 2 in §8.2, we deduce that the products of theta 
derivatives 

Dl = ■ D29ix ■ D^OooX and Dr = D29xo ■ D 2 OX 1 ■ D29xoo- (77) 

are modular forms for the full group SL(0'' © 0) of weight (|, |) with a character of 
order 8 according to Table 3 below, in which a = as usual. The function HO 

then equals DrDr and has character Vq of order 2. This computation also shows 


f{Zl,Z2) 

f{Zi + 1,Z2 + 1) 

f{zi + a,Z 2 + cr(a)) 

Zi '''^''/(1/Zi,l/Z2) 

Dl 

-Dl 

—CsDl 

-Dl 

Dr 

-Dr 

Cs^Dr 

-Dr 


Table 3. The action of SL(2, 0) on the D29l and D29r 


that if H = 1 (mod 8 ), then the Teichmiiller curve Wd has two components, the 
vanishing loci oi Dr and of Dr. This is our equivalent of McMullen’s spin invariant 
that we referred to after Theorem 5.2. 

Before proceeding to the proof of Theorem 9.1, we write out the Fourier expan¬ 
sions of the theta derivatives explicitly. Fix a basis (wi, 1 ^ 2 ) of 0 _d as we did in (25) 
for the construction of a Siegel modular embedding, say (wi, 012 ) = (1 ,70 = 

Then the first row the matrix B is the basis of 0 and the second row consists of 
the Galois conjugates. Given m G we let 

p{x) = (x + m) • ( 1 , 70 )^ 


for X G 1?. Then for (m,m') odd the theta derivative is given by 


21 ^ 


where qj = e{zj) as usual. 


Proof of Theorem 9.1. The modularity of T)9 for some level subgroup follows from 
the corresponding property of the Siegel theta functions, as shown in §9.1. It follows 
from the calculations above for = 1 (mod 8 ) that the factor group permutes the 
set of theta characteristics, preserving their parity. Moreover we read off from the 
table that D 6 * has character Ug in this case. In other cases D ^ 1 (mod 8 ), one 
checks similarly that the factor group still permutes theta characteristics, preserving 
their parity. In fact, classical calculations of Igusa show the full symplectic group 
Sp(4, Z) preserves the parity. Gonsequently, D9 is still a modular form, with some 
character, for the full Hilbert modular group. 
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For the second statement, let (7 be a curve of genus two with period matrix Z. It 
is cut out in its Jacobian as the vanishing locus of 0(o,o)(^', Z). See e.g. [12], Chap¬ 
ter VI and Chapter VII.l for standard properties of theta functions and Weierstrass 
points. The two-torsion points in Jac(C') are ZmF -|- (m')^ for m, m' G ■ The 

Weierstrass points of C are precisely the 6 points -I- (m')^ for {m,m') odd. 
Moreover, a holomorphic differential form to has a double zero on C if and only if 
uj vanishes (automatically doubly) at a Weierstrass point. 

Suppose the point [C] G M 2 lies on Wd- Then C has real multiplication, so 
Z = 'tp{z) for some fixed Siegel modular embedding ip. We identify the universal 
covering V of the Jacobian J{C) with the dual to 7J°(C', We claim that 

the characterizing condition on the existence of an eigenform with a double zero 
(see Theorem 5.2) is equivalent to the vanishing of the derivative of the Riemann 
theta-function in the second eigendirection U 2 at a Weierstrass point. In fact, the 
definition of the Abel-Jacobi map C -G J{C) implies that its projectivized tangent 
map is given by p >->■ (uJi(p) : cj 2 (p)) in the chosen eigenbasis of and 

so wi vanishes at p if and only if the t 62 -derivative of the defining equation of C 
in J(C) vanishes at p. Since we defined u = Bv this just means that 



^0(o,o).d(v,V'(z)) 


v=i/’(z)m'^ + (m') 


T 


= 0 


(79) 


for some even 

By the product rule, the derivative with respect to Vi of the defining equation 

0(o,o)(v -b -I-(m')^, Z)) = e(—TrfrnZrn"^ — 27riTO(v -b to')^) • 0(m,m')(v, Z) 

for Z = ip{z) has two summands. The factor given by differentiating the exponential 
term vanishes by definition for every pair of a point z G and v G Jac(C'z). Since 
e(-) is never zero, the vanishing of the derivative (79) is equivalent to z being in 
the vanishing locus of D6. 

To show that the modular form Mwd vanishes nowhere else there are several 
options. The hrst is to remark that the above argument can be inverted for Jaco- 
bians of smooth curves. So one has just to show that the vanishing locus of T)9 is 
disjoint from the reducible locus. We give two proofs of this fact that do not rely on 
any Teichmiiller theory in Section 12. Yet another way to conclude is to compare 
the weight of 'DO with the modular form that cuts out Wd in Bainbridge’s theorem 
from [2], Theorem 10.2. They are both of weight (3,9). □ 


The example D = 17 revisited. If we calculate the theta series and their 
derivatives for U = 17 as was done in Section 8.2, then we can verify that the prod¬ 
uct of Dii{zi,Z 2 ) with Dr{z 2 ,zi) is indeed proportional to the function given 
in (73), and similarly that the product of Dl{zi, Z 2 ) with Dl{z 2 ,zi) is proportional 
to the Galois conjugate function 


10. Cusps and multiminimizers 

Fix an invertible Oc-ideal a. Our aim in this section is to list the branches of 
the theta-derivative vanishing locus Wd through the cusp of the Hilbert modular 
surface Xd determined by the class of a. Our Ansatz is to describe the branch of 
Wd defined by a Hilbert modular form in (( 71 ,( 72 ) by (71 = q°‘ and (72 = q''^°‘\l +P), 



THETA DERIVATIVES 


57 


where q a suitable local parameter and P a power series with positive valuation 
in q. This will lead us to consider a minimization problem on quadratic forms. The 
solutions are given by so-called “multiminimizers.” 

The main result of this section will be the following characterization of cusps of 
Wd, derived from Fourier expansions only. We call an indefinite quadratic form 
Q = [a, b, c] standard!^ if a > 0 > c and a-|-6-|-c<0. To a quadratic form Q we 
associate the quadratic irrationality Ag = ■ The quadratic form is standard 

if and only if A = Ag satisfies 

A > 1 > 0 > A'^ . (80) 

A quadratic irrationality A satisfying (80) will also be called standard. 

Theorem 10.1. Suppose that D is a fundamental discriminant or, more generally, 
that a is an invertible Oo-ideal. Then there is a bijection between the cusps ofWo 
mapping to the cusp a of Xu and the set of pairs {Q,r) consisting of a standard 
quadratic form Q = [a, b, c] of discriminant D such that [(1, Ag)] = [a] together with 
a residue class r {a, c)Z. 

This will be proved in §10.2. We will also comment on the case of non-invertible 
ideals after a discussion of cusps of Hilbert modular surfaces in the same subsection. 

Given that Wu is a Teichmiiller curve, as we showed in Theorem 9.1 using 
the eigenform definition and will show again in the next section using the theta 
viewpoint only, this characterization reproves the list of cusps of [24]. (See also [2], 
Theorem 8.7(5).) For comparison, we will briefly sketch the approach based on flat 
surfaces in §10.3. 

The proof of Theorem 10.1 can be applied verbatim to prove the following result 
on the reducible locus Pu (see §5.2) from the theta function viewpoint. This result 
was also proven by Bainbridge using the flat surface viewpoint on cusps. 

Theorem 10.2. Let D and a he as above. Then there is a bijection between the 
cusps of Pu mapping to the cusp a of Xu and the set of pairs {Q,f) as in The¬ 
orem 10.1. In particular, for every given a the numbers of cusps of Pu and Wu 
mapping to the cusp a of Xu coincide. 

We will end the section by giving an algorithm in §10.4 to compute multimini¬ 
mizers using continued fractions. 

10.1. Multiminimizers. Let F{x,y) = Ax^ -\- Bxy Cy^ be a positive definite 
binary quadratic form with real coefficients. On each of the cosets of 2Z^ in 1? 
this form assumes its minimum a finite number of times. Of course on the trivial 
coset the minimum is 0 and is assumed exactly once, while on each of the other 
cosets the minimum is generically attained exactly twice, by some non-zero vector 
and its negative. We call the form [A, B, C] multiminimizing if on at least one of 
the three non-trivial cosets the minimum is attained more than twice. These forms 
are classified by the following proposition. 

Proposition 10.3. A positive definite binary quadratic form is multiminimizing if 
and only if it is diagonalizahle over Z. In this case, there is exactly one coset of 

■^There seems to be no standard terminology for these quadratic forms and quadratic irra¬ 
tionalities. They appear implicitly in [11] and [2]. 
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21 ? in Z2 on which the form has a multiple minimum; this minimum is assumed 
exactly twice (up to sign) and is the sum of the minima in the other two cosets. 


Proof. This is proved using reduction theory. Suppose that the form F is multi¬ 
minimizing. Since this property is obviously SL(2, Z)-invariant, we can assume that 
F is reduced, i.e. F = [A,B,C] with C > A > \B\. By Cauchy’s inequality we 
have \Bxy\ < A(x^ + ?/^)/2 , and hence 


F(x,y) > jx^ + (^C - y^ , 
for all (x,y) G In particular, 


x\ > 1, \y\ > 2 


F{x,y) 

> 

AC - 3A/2 

> 

x\ > 2, \y\ > 1 


F{x,y) 

> 

C + 3^/2 

> 

x\ > 3, \y\ > 1 


F{x,y) 

> 

C ± 4^ 

> 

x\ > 1, \y\ > 3 


F{x,y) 

> 

9C - AA 

> 


C, 

af\b\ + c, 

A+\B\ + C. 


These equations show that the smallest value of F on the coset “(odd, even)” is 
attained only at (±1,0) and equals A, that the smallest value of F on the coset 
“(even, odd)” is attained only at (0,±1) and equals C, and that the two smallest 
values of F on the coset “(odd, odd)” are attained only at (±1,±1) and are equal 
to A — B C and A-\- B + C. In particular, F is multi-minimizing if and only if B 
vanishes, in which case the only coset on which it attains its minimum more than 
once is “(odd, odd)” and the minimum there is attained exactly twice (up to sign) 
and is the sum of the unique minima in the other two cosets. □ 


Let a be a fractional 0£)-ideal in a real quadratic field K and f a non-zero coset 
of a in ia. (Thus there are three possibilities for f given a.) We denote by MM(a, f) 
the set of non-zero a £ K such that the quadratic form Fa{x) = tr(Q;a;^) is positive 
definite and assumes its minimum value on the coset f more than twice (up to sign), 
and call the elements of this set multiminimizers for f. We denote by MM (a) the set 
of all multiminimizers for a, i.e. the union of the sets MM(a, ^) for all three cosets f. 
Clearly, MM(a) is invariant under multiplication by positive rational numbers and 
by the squares of elements of the group C/d of units e of On- We set 

MM(a) = MM(a)/(Q+• C/|,) . (81) 

Later we will often use the representatives of a G MM(a) that are primitive in 
(a^)'^. They are unique up to multiplication by Uf,. 

Proposition 10.4. Let a be a fixed fractional On ideal of K. Then there is a 
bijection between MM(a) and the set of standard quadratie forms Q = [a, b, c] with 
b^ — Aac = D in the wide ideal class of a. 

Moreover, given a G MM(a), there is a unique basis (wi,W 2 ) sueh that uj 2 > 
wi > 0 of a with respect to which the form Fa is diagonal, and the coset f is then 
i(a;i ± W 2 ) ± a. 


Proof. Given Q 


[a, b, c] choose p G positive such that ^(1, Xq) = a and take 


—ac a b + '/D 

P^XqVD ~ 1?' 2/D 


(82) 
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This is positive definite if Q is standard. Since Q is simple the basis wi = 

UJ 2 = AiAq satisfies the conditions stated. Moreover, in this basis the quadratic 
form Fa is 

Fa = [A, 0, C], A = tr{auj'l) = a, C = tr(aa;|) = —c 

since tr(au;ia; 2 ) = tT^—ac/VD) — 0. Thus a is a multiminimizer. 

Conversely, if a is a representative of a class in MM(a) then it follows from 
Proposition 10.3 that there exists a unique basis {uii,uj 2 ) of a (up to interchanging 
the uji and changing their signs) of o with respect to which Fa{x) is diagonal, and 
the coset ^ is then i(wi + W 2 ) + a. This also proves the last statement. In this basis 
Fa = [A,Q,C] with A = tr(au;f) and C = tr(aa;|). We choose signs and order the 
basis such that A = W 2 /W 1 > 1 and claim that then 17 (A) < 0, so that A is the root 
of a standard form. In fact, we have a = qj( ijJ\IjJ 2 '/F) for some q € Q+. Thus 
C = tr(q\/\/13) and 

A = tr(q/(A\/B)) = -N(X)C. 

Since Fa is positive definite (by definition of a multiminimizer), the numbers A and 
C are positive, so this implies that N(X) < 0 as claimed. One checks immediately 
that A does not depend on the representative of the multiminimizer in MM(a, ^) we 
have chosen. We take Q = [a, b, c] so that A satisfies aA^ + 6A + c = 0 with a > 0 
and a, b, c coprime integers, and since (I, A) is an invertible Gd module, we then 
have b^ — Aac = D. 

Obviously, the composition Xq i—>■ q;(Aq) i—>■ A(a) is the identity. In the other 
direction, note that a is determined by (u!i,uj 2 ) up to a positive rational number 
and that fi with /r(l, Aq) = a is determined up to a unit. Consequently, each of the 
distinguished basis elements oji is determined up to a unit and a is determined up 
to a square of this unit. Since multiminimizers were defined in (81) by these two 
equivalence relations, this shows the bijection we claimed. □ 

10.2. Cusps of Hilbert modular surfaces. Classically, cusps of the Hilbert mod¬ 
ular surface X]j are defined to be the equivalence classes of points in '^^(K) under 
the action of SL(0^ © 0). Equivalently, we may define a cusp as an exact sequence 

0a'" ^ 0^ © Od ^ a ^ 0 

of torsion-free 0-modules up to the action of SL(0^ © 0) on 0^ © 0 and its sub 
0-modules. The modules a arising in this way are quasi-invertible, i.e. invertible 
0E-module for some order Oe A On = 0- Yet another equivalent viewpoint to 
define a cusp is by the class of an invertible OE-ideal a together with an element 



We briefly recall how to see the equivalence of these definitions. For the equiv¬ 
alence of the first two definitions, intersect the line L C determined by a point 
in P^(it') with a fixed embedding of O'^ © 0 in to get the exact sequence and 
conversely tensor the exact sequence with K over 0. That such an extension class 

is determined by r € Z/y^Z can be deduced from the calculation in [2], Proposi¬ 
tion 7.20 or [3], Theorem 2.1. 

As a preparation for the proof of Theorem 10.1 we determine the Fourier series 
of D9 at a given cusp a of Xd- For a basis a; = (a;i,a; 2 ) of the ideal a let Pi^{x) = 
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{x + to) • for X & 1?, with the dependence on to snppressed in the notation. 
Then for (to, to') G (f^)^ odd we define 

= ^ (-1)^"^'Cr(pc., (x)(83) 

a:GZ2 

where qi = e(zj). Note that T>20(m,m'),c..’ depends on the chosen basis u>, but a base 
change can be compensated for by letting the base change matrix also act on the 
characteristic (to,to'). Consequently, the product 

D29(^rn,m'),ui (84) 

(m,m') odd 

is of the form X^yGo ^od thus invariant under an upper triangular 

matrix in SL(a^ © a). 

Lemma 10.5. The Fourier expansion of T>9 at the cusp a is proportional to T)9a. 

Proof. To avoid the generally hard problem of finding the Fourier development of 
a modular form at a different cusp we use the fact that the vanishing locus of T)9 
has an intrinsic formulation in terms of eigenforms. In Theorem 9.1 we proved that 
this vanishing locus corresponds to the set of principally polarized abelian varieties 
with real multiplication such that the first eigenform has a double zero. This proof 
works for any Siegel modular embedding, for example the one given at the end 
of §5.1, where the locus of real multiplication is Xjj^a and if is constructed with 
the help of the matrix B as in (25) having a; = (wi,a; 2 ) as its first column. The 
restriction of the Siegel theta function with characteristic (to, to') via this modular 
embedding is just To complete the proof, we note that the cusp at oo 

of Xo.a is just the cusp a of Xd- To see this, take a matrix () nSL(2, K). 
It conjugates SL(0^ © 0) into SL(a'' © a), since and the line at infinity 

in P]f intersects a'' © a in the submodule a^, as required. □ 

Proof of Theorem 10.1. We first determine the cusps of the vanishing locus of T)9a^ 
as defined in (84), that map to the cusp oo of the Hilbert modular surface Xjj^a- 
Because functions in a neighborhood of the cusp a have the form Ci/e(tr(i/z)) 
with ly G (a''a )'^ = a^, we can choose a local parameter q at a branch of this locus, 

with g = 0 at the cusp, of the form q = e{z/a) with a G (a^)^ primitive. A lift of the 
vanishing locus to HI x HI looks locally like Z 2 = p(zi), where (/?(z) = ^z + C+ 
as 9 ( 2 ) —>■ 00 for some C G C and some power series £ in g with no constant term. 
Then we have 

e(jzzi + jz'^Z 2 )|iocus = e(C'iz'^) for all v £ , (85) 

so that the restriction of any Hilbert modular form becomes a power series in g. 
Making a different choice of the lifting would change C by an integral multiple of 
N{a)^'/D/a, so that the quantity 

S = e( - G C* (86) 

VlV(a)2/D/ 

is independent of the choice of the lifting. We will show below that S is in fact a 
rational power of an element of K*. 
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The resulting g-exponents after plugging (85) into (83) are of the form iT{apu) (a;)^). 
In order for the theta derivative to vanish, the smallest exponent of q must occur 
twice, so this quadratic form has to take its minimum twice (with x and —x not 
distinguished). Hence a is a multiminimizer for a. Recall that this specifies a 
only up to but here the fact that q = e{z/a) is a local parameter (or 

equivalently, that a is primitive) eliminates the Q(^-ambiguity. The C/|)-ambiguity 
corresponds to the fact that this group (considered as diagonal matrices in the 
Hilbert modular group) stabilizes the given cusp a of the Hilbert modular surface. 
Since acts transitively on the three non-trivial cosets C of ^a/a and since a mul¬ 
timinimizer has the multiple minimum property on exactly one of the three cosets 
by Proposition 10.3, we may suppose from now on that the branch is chosen such 
that for m = vanishes on that branch. 

We next determine a and exhibit for this purpose a convenient basis of a^. We 
suppose that wi, UJ 2 was from the beginning of the discussion the distinguished basis 
of a associated with a multiminimizer in Proposition 10.4. We let g = gcd(a, c), 
where ax^ -I- -I- c = 0 is the minimal polynomial of A = 022 /^ 1 . Choose s,t G Z 

such that sa + tc = g. It is easily verified that 

= {a ,13*), 


where 


a 


5 2 I ^ JO* 

—Wi -I- OJ1UJ2 and p 

a a 


1 

-UJ1UI2 

9 


since gcd(a, 6, c) = I. (In fact, contains -I- ^WiW 2 using the minimal poly¬ 
nomial, so ^a;iW 2 and finally ^u’iU ’2 by the gcd condition. The ideal also contains 
+ ^uiiUJ 2 and, since gcd(a,te) = g, also a*. The converse inclusion follows 
from the line below (89).) This basis is chosen such that the dual basis is {a,/3} 
where 


—ac a b + '/D 

gLjj\\\/15 gtOi 2\/1 j 


(87) 


As local coordinates on the Hilbert modular surface we now pick X = q^ ^ 
and Y = q^ q^^^ \ which are power series in the local coordinate q beginning with 
q and S, respectively. In these coordinates the factors of DOa have the expansion 


where Xi = Xi + ^ and where 


F = 


-, 0 ,— 

L5 9 J 


G = 


—bt, 2g, bs 


(89) 


(This follows from — bt(3* = wj, “Ig P* = 2a;ia;2 and —|a* — bsj3* = u;|.) 
Then has an expansion in integral powers of X and 


but we actually need the product T = D 2 O 1 11 

(( 2 > 2 h( 2 


Doe 


((2 > 2 bfO. 2 


and this 


product, divided by , has integral powers of Y as well. 

Finally we need to show that map from cusps to multiminimizers is onto and 
that the hbers have cardinality g. Analyzing the lowest order coefficient in q of T, 
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to which precisely the summands xi = and X 2 = contribute, and noting 
that G{^, 5 ) — = g, we hnd that the S of ( 86 ) has to be a solution of 


= 



a 


(90) 


This equation has precisely g solutions, differing by (/th roots of unity. For each 
such solution of the lowest order term in q there is a unique power series e{q) such 
that (85) is in the vanishing locus of D6a, since the coefficients of e{q) are recursively 
determined by a triangular system of equations. (We will discuss the arithmetic 
properties of £(< 7 ) in §13.1.) 

This completes the proof for the vanishing locus of VOa at the cusp 00 and by 
Lemma 10.5 also the proof of Theorem 10.1. □ 


A statement like Theorem 10.1 can certainly be proven along the same lines 
also if a is not an invertible Ou-ideal (which can of course only happen for non¬ 
fundamental D). This has an effect in Lemma 10.5, and the fact that gcd(a, 6 , c) > 1 
changes the computation of the basis {«*,/?*} that was used in the proof. Since 
our aim is just to demonstrate the method of reproving the properties of genus two 
Teichmiiller curves using theta functions, we do not carry this out in detail. 


Proof of Theorem 10.2. The reducible locus is the vanishing locus of the product 
of all 10 even theta functions. Thus branches of the vanishing locus have to be 
parametrized as in (85) with a a multiminimizer. For m = (0, 0) the forms are 
never multiminimizing and in all other cases the proof proceeds as the proof of 
Theorem 10.1. Only the coefficients of the equation (90) change, but not the 
exponents. This does not affect the number of cusps for each multiminimizer. □ 


10.3. Cusps of Wd via flat surfaces and prototypes. The cusps of the Teich- 
miiller curve Wd were first determined in [24] based on the following observations. 
It was discovered by Veech along with the definition of Teichmiiller curves that 
cusps correspond to directions (considered as elements of R^/R*) of saddle con- 
neetions (i.e. geodesics for the flat metric |w| starting and ending at a zero of w). 
Applying a rotation to the flat surface, we may suppose that the direction is hori¬ 
zontal, so that the subgroup of GL(2,R) stabilizing the direction consists of upper 
triangular matrices. McMullen discovered that flat surfaces parametrized hy Wd 
always decompose in saddle connection directions into two cylinders as indicated 
in Figure 2. The action of the upper triangular group allows us to assume the 
upper cylinder to be a square, while still having the freedom to normalized by the 
action of T = (q ]). McMullen further deduces from real multiplication that the 
two cylinders must be isogenous with a homothety in a real quadratic field. We 
thus may suppose that (a, c, q) are integral and A is real quadratic. A more careful 
analysis of the isogeny (see [24], Equation (2.1), or [2], Proposition 7.20), impos¬ 
ing moreover that Od is the exact endomorphism ring of a generic abelian surface 
parametrized by the Teichmiiller curve, implies that A = [a, b, c] with a and c as in 
Figure 2 and such that (a, b,c,r) = 1. 

Obviously a > 0, c < 0 and the side length inequality —aX^ < —c is equivalent 
to a + b + c < 0. By the action of T we may reduce mod gcd(a,c). Altogether 
this implies that cusps of Wd correspond bijectively to pairs (A = [a,b,c],q) with 
A standard and q G Z/(a, c), as in Theorem 10.1. 
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-aX^ 



Figure 2. Prototype of a flat surface parametrizing cusps of the 
Teichmiiller curves Wd- Sides with the same label are identified. 


10.4. Computing multiminimizers. There are many variants of how to expand 
real numbers into continued fractions. One may subtract (slowly) one at each step 
or group these steps together, one may subtract one while being greater than one, 
greater than zero, or until becoming negative. We will need two of them, one here 
and one in §11.2. 

The slow plus greater-than-one continued fraction expansion of x G R>i is defined 
hy xq = X and then inductively by 


f Xn-l if Xn > 2 

\ l/{Xn - 1) if 2 > > 1 ’ 


so that 


X = 1 




1 + 


1 + --- + 1 + 1 + ^ 


The class of the ideal [a] = [(l,a;„)] is unchanged under this continued fraction 
algorithm, since (l,a;„) = (l,a;„ - 1) = (a;„ - 1)(1, y;^). 


Lemma 10.6. The slow plus greater-than-one continued fraction expansion of any 
quadratic irrational A is periodic, and is pure periodic if and only if A is standard. 


Proof. The classical proof of Lagrange (for the more usual fast continued fraction 
algorithm with plus sign, where A is sent to A — [AJ if A > 1 and tol/AifO<A<l) 
applies here as well. The slow algorithm here just introduces intermediate steps and 
combines two steps when 1 < A < 2. The second statement is proven in [11]. □ 

This observation gives us an algorithm to compute all multiminimizers for a 
given ideal a. Write a = /io(Ao, 1) and apply the continued fraction algorithm to 
Ao until it becomes pure periodic, say at a standard quadratic irrational Ai with 
a = /xi(Ai, 1). The first multiminimizer is then the Uf, ■ Q*-class of oi = 

as in (82). Proceeding in this way with ak = ^r^/rr where a = /ifc(Afe,l) for 

A 2 ,..., A„ along the period of Ai gives all the multiminimizers for a. 
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In each of the steps we may choose a in its Q*-class to be primitive in some fixed 
0 D-module. We will use this normalization for the Bainbridge compactification in 
the next section. 


11. The Hirzebruch and the Bainbridge compactification 

Hirzebruch’s minimal smooth compactification is a toroidal compactification. 
In this section we reinterpret the Bainbridge compactification of Hilbert modular 
surfaces as a toroidal compactification, see Theorem 11.5. This compactification 
was defined originally via the Deligne-Mumford compactification of the moduli 
space of one-forms on Riemann surfaces, We will see that both Hirzebruch’s and 
Bainbridge’s compactification can be computed using continued fraction algorithms. 

We start with a review of toroidal compactifications from the viewpoint of curve 
degenerations and we recall Hirzebruch’s continued fraction algorithm. The proce¬ 
dure presented in §10.4 to compute multiminimizers is very similar to Hirzebruch’s, 
but the continued fraction algorithms are different. The geometry of Bainbridge’s 
compactification has been described in [2]. We recall this material in §11.4, since it 
is part of the main Theorem 11.5. Once this is proven, all geometric properties will 
follow from general facts about toroidal compactifications. (See Proposition 11.2.) 
The proof of the main theorem will be given in §11.5. 

The description of toroidal compactifications will make it easy to determine 
(in §11.6) where the cusps of modular curves Fm or Teichmiiller curves Wd in¬ 
tersect the cusp resolution cycle both of the Hirzebruch compactification and the 
Bainbridge compactification. Finally, in §11.8 we compare the two continued frac¬ 
tion algorithms governing the two compactifications and give formulas for the total 
lengths or the cusp resolution cycles in both cases. 


11.1. Toroidal compactifications. The reader may consult the textbook by Ash- 
Mumford-Rapoport-Tai ([1]) as general reference on toroidal compactifications. We 
give a self-contained treatment for the case of Hilbert modular surfaces. Locally 
near (oo,oo) G a Hilbert modular surfaces is ^/G{M,V), where M is a com¬ 
plete submodule of K (i.e., an additive subgroup of K that is free abelian of rank 
two) and G(M, V) C SL(0^ © Od) is the semidirect product of M and some sub¬ 
group V of totally positive units (see e.g. [39]). 

We study the limiting behavior of a complex curve C in parametrized by 
r G El and having the asymptotic form 

Zi = JT + Aq + Aiq + A2q^-I - 

„ 2 ( 91 ) 

22 = 7 T + Bo + Biq + B 2 q^ H- 

for 3(r) —>■ oo, where 7 G AT is totally positive, q = e(T) and Ai,Bi G C. If this 
curve descends to an algebraic curve in the Hilbert modular surface A^, so that the 
intersection with the subgroup M of the cusp stabilizer is not trivial, we can always 
assume that the leading coefficients are of that form. In fact, such a curve is always 
given as (part of) the vanishing locus of some Hilbert modular form. Looking at the 
g-expansion of this Hilbert modular form, we deduce (91) by the same argument 
as given at the beginning of the proof of Theorem 10.1. 
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To each totally positive a £ M we associate a copy of P^(C). We have a map 

f IBP/M C* C Pi 




(^ 1 , 2 : 2 ) e (tr(a'^ 2 )) = e 


N{M)Vd ') 


where N{M) denotes the norm of M. Using this identification, we glue Pi at the 
cusp (00,00) of topologically as follows. A sequence of points 2 = (21,22) G 
with imaginary parts yi , y2 both tending to 00 with limiting value of yi jy^ being 
equal to t G M+ converges to the point 0 G Pi if t < aja^, to the point 00 G Pi 
if t > a/a'^ and to a finite point X = e{9/N{M)'/D) for some 0 G C if t = a/a'^ 
and if 0*^21 — a22 = 0 + o(l). 

Consequently, the curve C meets Pi at 00 if tr(a^7) < 0 , at 0 if tr(a^7) > 0 , 
and at a finite, non-zero, point if 7 is a rational multiple of a. 

To any oriented Q-basis (a, (3), i.e. with corresponds a pair of projective 

lines Pi and P^ meeting at one point (00, 0 ) G Pi x P^. Suppose that 7 G AT has 
the property 

0 7/3 

— < — < 

Q,0- ry(y pa 

holds, so that the curve C passes through the point (00, 0 ). If we write ^ = pa+ q(3 
with p, <; G Q+ then, near the point (00, 0 ) the curve ( 91 ) looks like 

{x-y = {XpY. 


We will, of course, apply this in particular to the curves Fjv and to Teichmiiller 
curves. 

A U-invariant partial compactification H^/M of a Hilbert modular surface is 
defined by adding not just one Pi, but an appropriate sequence of Pi’s. 

Definition 11.1. A sequence of numbers (onjneN in M forms a fan if i) the an 
are all totally positive, ii) V- {anjngN = {anjneN and Hi) the ratios afjan are a 
strictly decreasing sequence. 


Our definition includes that the fan is U-invariant. If U = (e), then ii) is 
equivalent to the existence of some k such that ea„ = ctn+k- The minimal positive 
k with that property will be called the length of the fan. 

Proposition 11.2. A fan (an)neN determines a partial compactification IP/M of 
IP/M with the following properties. For each n there is an irreducible curve Pi^ 
in IP/M \ IP/M. The curves Pi^ and Pi„^j intersect in one point. For k > \ 
the curves Pi^ and Pi^^j, are disjoint in H^/M. 

The action of V oniP/M extends to an action on IP/M. Hence a fan deter¬ 
mines a partial compactification IP/G{M,V) oflP/G{M,V). 

The partial compactification is always smooth along Pi^ minus the intersection 
points with the Pi„^j. At the intersection point of Pi^ and Pi^^^ the compactifi¬ 
cation is smooth if and only M = Zun + Zq;„_|_i. More generally, if Zun -f Za„_|_i 
has index k in M, then P? /M has a cyclic quotient singularity of order k at that 
point. In particular, the compactification is normal. 


Proof. We first recall the general setup of two-dimensional toric varieties, e.g. the 
book of Fulton [14] provides an introduction to this topic. For each two-dimensional 
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cone a given, say, as the span of an and On+i we let Ua be the variety with 
coordinate ring flM^]. For each one-dimensional cone t at the boundary of 02 , 
say spanned by a„ we let Vr dUahe the variety with coordinate ring C[(t^ 

If r is contained both in ai and 02 we may glue and along the open set Vr. 

In particular a„ (or t = (an)) determines a rational curve = (C/o-i U \ Vr- 
The zero-dimensional cone 0 corresponds to the variety W with coordinate ring 
C[M^] and sits as open part in all the Vr and Va. Here is the dual Z-module, 
which we will identify from now on as a submodule of K using the trace pairing. 

In our situation we want to identify W with C^/M. On the level of local coor¬ 
dinate rings this is done by assigning to 5 G the coordinate function 

{zi,Z 2 ) = e{bzi +b'^Z 2 ). 

(This is the same coordinate as X^v associated above with a G M, using the 

identification of M with via a ^ a^ = —We refer to a"^ as the trace 

n{m)Vd 

dual of a.) Since H^/M sits inside C^/M, the partial compactihcation of W by the 
Ua defines the desired partial compactihcation of H^/M. 

Given a cone t generated by a„, the element a)( is the unique (up to sign) 
primitive element of with tr(a„Q;)() = 0. If we complete a)( to a basis of 
using /3„, whose sign we may choose such that tr(a^/3„) > 0, then the curve 
{X^^ = 0} is independent of the choice of /3„ and coincides with the curve 
dehned in the text preceding the proposition. 

Given two consecutive elements and a„+i of the fan, the monotonicity of the 
ratios implies that 

tr(a^Q;„+i) < 0 and tr(a^_i_;^a„) > 0. 

Hence a)) and can play the role of /3„ above. Gonsequently, the intersection 

point of P^^ and is the point = 0, = 0}. Since Xa^ is a 

coordinate on , we retrieve that the intersection point of P)j^ and P^^^^ is (00, 0) 
in that coordinate system. 

The disjointness of P)^^ and Pa„_^j, is obvious from the gluing procedure. The 
statement on the H-action is an obvious consequence of the H-invariance of a fan. 
The singularity statements are described in detail in [14], Section 2.2. □ 

Replacing the Q-basis (a, (3) by {a + (3, (3) resp. by (a, a + (3) corresponds to 
performing a sigma-transformation (or blowup) defined by the coordinate changes 

{X-\Xp)e^{X-^X/,Xp) resp. {X-\Xp) {X-\X^^Xp) 

at that point. It follows that the partial compactifications defined by any two fans 
are related by repeated blowing up and blowing down. 

11.2. Hirzebruch’s compactification. We describe the fan of Hirzebruch’s com- 
pactification. There are many detailed expositions of this, in particular [17] and 
Chapter 11 of [39]. The aim is to compare the fans of Hirzebruch and Bainbridge 
below. We will use the letters Ak for Hirzebruch’s fan (which is consistent with [39]) 
and subscripts k for the indexing that produces an increasing sequences of slopes. 
(Thus if we replace An by Ak^-n for some /co we fit exactly the Definition 11.1.) 

Suppose we want to compactify the Hilbert modular surface at the cusp a 
or equivalently the cusp at 00 in Xo,a- Then, in the notation of §11.1 the module 
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M = a'^(a^^) = (a^)^- We choose the Ak to be the set of extreme points in 
M+ = M n (M+)^, i.e. the points lying on the convex hull conv(M+) of M+ in 
(]R+)^, indexed by increasing slope then the Ak form a fan since conv(M+) is V- 
invariant. This compactification is smooth, because any two adjacent points on the 
boundary of conv(M+) form a Z-basis of M ([17] or [39], II) Lemma 2.1). Since 
the Ak lie on the boundary of the convex hull, we can write 

Ak-i + Ak+i = PkAk with pfe G Z, p > 2 . (92) 

On this smooth compactihcation the self intersection number of is —pk 
([39], § II.2). Consequently, the compactification using at a cusp with stabilizer 
G{M, V) the fan given by the boundary points of conv(M+) is the minimal smooth 
compactification. 

We call X reduced (or a reduced quadratic irrationality) (see [39], Section 2.5 or 
[45]) if X is real quadratic and 


a; > 1 > > 0. (93) 

The fast-minus continued fraction algorithm of a; G M > I is defined by a;o = a; and 
then inductively by 

Xk+i = 1/iPk-Xk), where pk = [aifc] , (94) 


so that 

1 

— Pk ^ 

Pk+1 - 

Note that the narrow class of the ideal [M] = [(I, a;)] is unchanged under this 
continued fraction algorithm, since (l,x) = (p — a;)(l, The following lemma 

is the analogue of Lemma 10.6 above. 

Lemma 11.3. The fast minus continued fraction expansion of any quadratic irra¬ 
tional X is periodic, and is pure periodic if and only if x is reduced. 

Proof. This is well known and is stated, for instance, in §2.5 of [17], where the 
relation of the fast plus continued fraction expansion and the fast minus continued 
fraction expansion is also given. □ 

This observation gives us an algorithm to compute the convex hull. Write M = 
/io(l) xq) and apply the continued fraction algorithm until it becomes pure periodic, 
say at a reduced quadratic irrational xi with M = /xi(l,xi). Take Aq = pi and 
Ai = piXi and then Ak = Ak-ijxk for k > 2, where xi,X 2 ,... ,Xn form the cycle 
of the continued fraction algorithm. The recursive definition of the Xk in (94) then 
is equivalent to (92). 

Conclusion: The Hirzebruch compactification is given by the fan stemming 
from the lower convex hull or, equivalently, triggered by the fast minus continued 
fraction algorithm. 
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11.3. Bainbridge’s compactification. Bainbridge’s compactification of a Hilbert 
modular surface is defined using the Deligne-Mumford compactification of the mod¬ 
uli space of curves of genus two. For the details we recall facts about various bundles 
of one-forms over moduli spaces. 

In Section 5.3 we introduced together with Teichmiiller curves the vector bundle 
HMg of holomorphic one-forms over the moduli space Mg of curves of genus g. The 
moduli space of curves comes with the Deligne-Mumford compactification by stable 
curves Mg and the vector bundle HMg extends to a vector bundle HMg, whose 
sections are stable forms. A stable form is a differential form on the normalization 
of the stable curves, holomorphic except for at most simple poles at the preimages 
of nodes and such that the residues at the two branches of a node add up to zero. 
We refer to DMg and to the corresponding projective bundle PDMg as the Deligne- 
Mumford compactification of DMg (resp. of PHMg). 

Strictly contained between Mg and Mg is the partial compactification Mg of 
stable curves of compact type, i.e. stable curves whose Jacobian is compact or 
equivalently of arithmetic genus g. 

For 5 = 2 the Torelli map t : M 2 —>■ Ai is an isomorphism. It extends to an 
isomorphism of the bundles of stable one-forms t : HM 2 —>■ VtA 2 and also to the 
projectivized bundles t : PHM 2 —>■ TVlA^- 

On the other hand, we have seen in Section 5.1 that a Hilbert modular surface 
parametrizes principally polarized abelian varieties with real multiplication. There 
is a unique holomorphic one-form on such an abelian variety that is an eigenform 
for the action of real multiplication (with the embedding K ^ M. that we fixed 
throughout). The quotient map of a Siegel modular embedding by the action of 
SL(2, Od) defines a map X]j —>■ A 2 and the choice of an eigenform lifts this map 
to an injection 

ip Xjj — y PHA2. 

We thus use the same letter for this map as for the modular embedding. The image 
t~^{'p{Xu)) is called the eigenform locus (maybe the projectivized eigenform locus 
would be more precise). It parametrizes stable curves of genus two of compact type 
with real multiplication by Oo. We denote hy Xd the closure of the eigenform 
locus in the Deligne-Mumford compactification PDM 2 . 

Definition 11.4. The Bainbridge compactihcation (called the geometric compacti¬ 
fication in [2]) Xd^ of a Hilbert modular surface Xjj is the normalization of Xd^^ . 

We now identify this compactification as a toroidal compactification. We restrict 
to the case D fundamental for simplicity and since the preparations in Section 10 
have been carried out for this case only. In §11.5 we will prove: 

Theorem 11.5. Suppose that D is a fundamental discriminant. For each cusp 
given by the ideal class a the sequence of multiminimizers for a, as derived from 
the continued fraction algorithm in § 10. f, forms a fan. The Bainbridge compact¬ 
ification is the toroidal compactification of the Hilbert modular surface obtained by 
using this sequence of multiminimizers. 

Comparing with §11.2, we can summarize this theorem and the algorithm in 
§10.4 as follows. 
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Conclusion: The Bainbridge compactification is given by the fan stemming 
from the multiminimizers or, equivalently, triggered by the slow plus greater-than- 
one continued fraction algorithm. 

In §11.4 we review the properties of the compactification by Bainbridge. For the 
proof of Theorem 11.5 we will need only part of these properties. The local structure 
at the intersection points of the compactification curves is forced by normality and 
could also be derived from Proposition 11.2. 


11.4. Period coordinates and properties of the Bainbridge compactifi¬ 
cation. In order to identify the Bainbridge compactification and to prove Theo¬ 
rem 11.5 we use the part of the work of Bainbridge, where he gives a coordinate 
system of OM 2 and describes Xd in there. He uses a lift by choosing a scaling 
of the one-form uj by fixing locally a loop ai and by imposing that to = ri 
for some ri G K. We will then compare the coordinates introduced for toroidal 
compactifications to this coordinate system and thereby prove the claimed isomor¬ 
phism. 

There are two relevant types of coordinate systems, both called period coordinates 
depending on the type of the stable curves. We follow [2], Section 6.6. If {X,uj) is 
in the boundary of Xo , then g{X) is zero. The first type of coordinate system 
is around a stable curve X with two non-separating nodes. We moreover suppose 
that UJ has two simple zeros. We let ±ri and ±r 2 be the residues of w. Choose 
loops Q;i,a 2 around the punctures such that f uj = ri. For a smooth surfaces 
in a neighborhood choose loops /3i, (32 that complete the ai-curves to a symplectic 
basis. Finally, choose a path I joining the two zeros of uj. Then on a neighborhood 
of {X,uj) in HM 2 the functions 



are well-defined, i.e. independent of ambiguity in the choice of Pi given by Dehn 
twists around the corresponding ai. These five functions form a system of coordi¬ 
nates on HM 2 . We provide them with a superscript E (edge) to distinguish them. 
They will correspond to edges of the boundary of Xd^^. Note that x = is 
only well-defined up to an additive constant depending on the path of integration 
and that its sign depends on the choice of ordering of the two zeros. 

From the geometry of the Deligne-Mumford compactification Bainbridge derives 
the following proposition that we use to prove Theorem 11.5. 


Proposition 11.6 ([2], Proposition 7.18, Theorem 7.17 and Theorem 7.22). A pair 
(ri,r 2 ) appears as residues of an eigenform of Xjj if and only if X = r 2 jri € Q{'/D) 
and N{X) < 0. More precisely, the irreducible components of the boundary of 
Xu are in bijection with the unordered projective tuples (ri : r 2 ), or equivalently 
to standard quadratic irrationals A in Ou- 

Near a boundary component labeled by X, the Hilbert modular surface Xu is 
cut out in HMg by the equations 


= 


(B) = 


ri, w 


r 2 and 


(96) 
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and the boundary curve is given in these coordinates as = 0 }. 

The boundary of Xd is a union of rational curves C\ where X = rxjr^- 

We explain the last statement. If gcd(a, c) > 1, then (96) shows that the com- 
pactification is not normal near {y^^^ = = 0 } : the normalization has gcd(a, c) 

local branches. Nevertheless, the preimage of = z^^'^ = 0} in the normal¬ 

ization Xu is a connected curve C\. The normalization map is a cyclic covering 
of order gcd(a, c) ramified precisely over the intersection points of C\ with its two 
adjacent curves in the cusp resolution. 

We include the following proposition for a complete description of the Bainbridge 
compactification. It is not needed for the proof of Theorem 11.5. 

The second type of coordinate system is around a stable curve X consisting of 
two irreducible components joined at three non-separating nodes. Here we choose 
ai,a 2 ,ex^ to be the loops around the punctures and let Cj = cv be the residue. 
We may orient the ai so that ri — r 2 -I- ra = 0. The one-form oj necessarily has one 
zero on each of the irreducible components. We let 7 i be a path joining these two 
zeros that crosses ai once with positive intersection and no other aj. Then the five 
functions 

= [ uj, = f OJ, = e( f OJ/ 

J Ol\ J 0c2 73 

y^^'^ = e = e oj/w^^^'^ (97) 

form a system of coordinates on HM 2 near {X,oj). We provide them with a super¬ 
script V (vertex of the compactification cycle) to distinguish them. We also have to 
say how the coordinates , ■ ■ ■, z^^^ and , • ■ •, z^-^^ are related near a vertex. 
Suppose we unpinch the node corresponding to 03 . Then on this nearby surface 

/3i = 71 - 73 , /32 = 72 + 73 , = 73- 

Consequently, we have 

= v^^^ =: V, =: w 

and 

/{w — v)), y^'^^ = y^^^ ■ /v), z^'^^ = z^^^ ■ /w) . 

The following proposition describes the intersection points of boundary curves 
of Xo ■ It is proven in [2], Theorem 7.27 but it is also an immediate consequence 
of Theorem 11.5 and Proposition 11.2, in particular its last statement. 

Proposition 11.7. If X'^ denotes the successor of X for the slow plus greater-than- 
one continued fraction, then the curves C\ and C'>+ have exactly one point c\ in 
common. Near this intersection point the Hilbert modular surface Xjj is cut out 
in HMg by the eguations 

y(v) _ yj(v) _ ^ ^ 

The point c\ is a cyclic quotient singularity of order 

a 

gcd(a, c) gcd(a, a -I- 6 -I- c) 


toa = 
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11.5. The proof of Theorem 11.5. The first step is to show that the multimin- 
imizers form a fan. For the cusp a we may choose the normalization as in (87) 
and suppose that the multiminimizers are primitive elements in (a^)^. We next 
examine the slopes. Suppose that at some step of the slow plus greater-than-one 
continued fraction we have the representation a = /ifc(Afc, 1). Then 

g(afc) ^ Afc/Xfc , <j{ak+i) ^ (Afc - 1)^1 

ak ^ ctk+i cr((Ak - l)/i|) 

in both cases of the continued fraction algorithm. The ratio of these two fractions 
is 


^ Cl 1 Wj _ 1 

ak ak+1 V Afc —1/V cr(Afc) 

since Afc is standard. Consequently the slopes of multiminimizers are decreasing. 
Since the continued fraction algorithm is periodic this shows that the sequence 
multiminimizers in continued fraction order forms a fan. We denote the compact- 
ification of the Hilbert modular surface using these fans of multiminimizers by 

—^MM 

Xd 

Ad Ad, * B , * D Ad,^ 

We let Affcj , Xd , resp. Xd be the complement of the intersection 
points of the cusps resolution curves in the three compactihcations, i.e. obtained 
by removing the codimension two boundary strata. Our aim to show that there 

-MM,* -DM,* 

is a map m : Xd Xd , which is an isomorphism over Xd and which 

maps the boundary components labeled by a standard quadratic irrational A onto 
the component with the same label by an unramified cyclic covering of degree 
g = gcd(a,c). Since the Bainbridge compactification is normal, the map m factors 

x-v Ad Aid, * B , * 

through a map m : Xd —>■ Xd , which is an isomorphism by the local de- 

. . r 1 “t7— B^* — Dlid,* . (-> • 

scription of the map Xd Xd ■ Since the multimimmizer compactihcation 

and the Bainbridge compactification are normal, the codimension two indetermi¬ 
nacy of TO (on domain an range) can be resolved to a global isomorphism. 

We want to define to in a neighborhood of a point on the component of the 

-MM,* 

boundary oi Xd given by the multiminimizer a with corresponding standard 
quadratic irrational A. Since is the unique (up to sign) primitive element in 
(a^)^ with tr(aQ;'^) = 0, local coordinates near this point as defined in the proof of 
Proposition 11.2 are just the coordinates X = q 2 ^°‘ ^ and Y = qf q^^^ ^ used in 

the proof of Theorem 10.1. 

The map to is given by assigning to a point va Xd the curve given as the vanishing 
locus of the Siegel theta function restricted to Xd together with the first eigenform. 
In order to understand the local behavior of this map near the boundary we may 
choose any convenient translate of the theta function. As in the previous proofs 
we take the characteristic ((|, 5 ),( 5 , 0 ) and the basis a; = (a;i,a; 2 ) of a that is 
distinguished by the multiminimizer. Moreover, we define elliptic coordinates 



S'= e(tr(a;)^u)), T = e(tr(w 2 u)) • 
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In the coordinates X, Y, S, T, the theta function is 

^ (-l)2^'(rn')"'js:F(£i,5l2)/2yG(2i.S2)/25.xiy^2 (gg) 

= yG(i-i)/2(.^ - T) + 0(X)^ , 

where Xi = Xi + ^ and where 

F = [a/g,0,-c/g], G = [-bt,2g,-bs\ . (99) 


Consequently, since the boundary is given by X = 0 the vanishing locus of 9 
degenerates there. As in §6.2 the limiting curve is a rational curve. We abbreviate 
Z = y®( 2 ’2)/2^yG(2.-2)/2 _ y9/2^ obtain (for Z fixed) the equation 


Z-^{ST -1) + {S -T) = 0 or, equivalently {S - Z){T + Z) = 1 - 


for the rational curve. A parametrization with coordinate t is given by S' = t + Z, 
T = (1 - Z‘^)lt- Z. 

On a boundary component of the genus two Deligne-Mumford compactification 
parametrizing rational curves with two nodes, the cross-ratio of the four points is 
a coordinate. This is also the relative period xe that was used by Bainbridge as 
coordinate (cf. Proposition 11.6), up to a Mdbius transformation depending on ri 
and 72, as one checks easily be integrating the limiting stable form. 

Our last task is therefore to express the cross-ratio in terms of the coordinate X. 
The first eigenform is 


UJ = dm = = 


{u:Xr 


,_ 

t -I- Z t(l — 


dt. 


( 100 ) 


in the given rational parametrization. The cross-ratio of the four poles 0, oo, —Z, 
and is given by a Mobius transformation of Z“2 = ygcd(a,c)^ This completes 

the proof of the properties claimed about the map m, and of Theorem 11.5. 


11.6. Intersection of with Bainbridge’s and Hirzebruch’s boundary. 

We can now answer the question about the intersection of the modular and Teich- 
miiller curves with the two compactifications. For the Hirzebruch compactification 
the statement in Theorem 11.8 is of course already in the literature. 

Fix the cusp to be infinity and let F^ be one of the modular curves, as defined 
in §5.2, passing through the cusp at infinity. Then F^ can be given by an equation 
of the form 


Azi -b A'^Z2 + B = Q (B g Z, a G 0, AA*^ = -N). (101) 

The following result now follows immediately from Proposition 11.2 and the de¬ 
scriptions of the Hirzebruch and Bainbridge compactifications given in this section. 

Theorem 11.8. The curve Fpj passes through an interior point of the eusp reso¬ 
lution cyele of the Bainbridge compactification (resp. of the Hirzebruch compactifi¬ 
cation) if there is a multiminimizer q;„ for this cusp such that oinl\ G Q (resp. if 
there is an element A„ of Hirzebruch’s lower convex hull fan such that AnfX G Q). 
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Otherwise, if 


a. 


A 

> ^ > 
a" \'^ 


Q^n+1 


a. 


resp. 


n+l 


An 


A. 


A 

A 


n+l 


n+l 


then the curve Fpf^ passes through the node corresponding to the intersection of the 
curves associated with a„ and an+i (resp. with + and An+iJ. 


This result together with Theorem 10.1 and Theorem 10.2 reproves from the 
theta viewpoint another result of Bainbridge. 

Corollary 11.9. The curves Wd and the components of the reducible locus Pd 
intersect the boundary of the Bainbridge compactification only in interior points of 
the boundary curves. 


Note that the component Pd,u of Pd as defined in Proposition 5.1 is given by 
equation (101) with A = i^VD, B = 0. 


11.7. Examples. Case D = 17. We consider the cusp at oo for SL(0^ © 0). In 
this case M = and the standard quadratic forms are [2,—3,-1], [2,—1,-2], 
[1, —3, —2],[1, —1, —4] and [1,1, —4]. The following table contains the corresponding 
multiminimizers a £ Of,, scaled by the factor y+f. 


standard 

n 

A 

VUa 

[1,-3,-2] 

1 

3+vT7 

2 

-s+vTf 

2 

[1,-1,-4] 

2 

1 + vT7 

2 

-i+VTt 

2 

[1,1,-4] 

3 

-1 + vT7 

2 

i+VTf 

2 

[2,-3,-1] 

4 

3+vT7 

4 

3+vTf 

2 

[2,-1,-2] 

5 

1 + vT7 

4 

4 +v+7 

VUAk 

-3+©17 

2 

-1+©17 

2 

1 + ©17 

2 

3+©17 

2 

4 +v+7 

Xk 

7+vT7 

4 

9+vT7 

8 

7+vT7 

8 

5+vT7 

4 

5+vT7 

2 

k 

5 

4 

3 

2 

1 


By the singularity criterion Proposition 11.2 this compactification is smooth. 

A reduced quadratic irrationality is a;i = (5 + vT7)/2. Its fast minus continued 
fraction is listed in the lower part of that table. The point Ai = -^^(4 + vTf) 
lies on the lower convex hull of 0^^ H (K.+)^. By the algorithm for Hirzebruch’s 
compactification the subsequent points on the lower convex hull are defined by 
Ak+i = Akjxk, as listed in the table. 

The Bainbridge compactification has no singularities at the points c\ and the 
number of reduced quadratic forms equals the number of standard quadratic forms. 
Hence the Hirzebruch and the Bainbridge compactification coincide in this case. 
This is consistent with the table listing the same values for vTfcin and for vTfAfe. 
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Case D = 41. There are 11 standard and also 11 reduced quadratic forms. But 
here the Hirzebruch and the Bainbridge compactification do not coincide. 


standard 

n 

An 

H 

1 

H 


[2,1-5] 

8 

-1+vAi 

4 

83+13VIT 
_2_ 

1 


■\/4TAfe 

Xk 

k 


1 


6 + V^ 

9+V4T 

10 

5 


1 

826 -k 129V4T 

11+V4T 

10 

1 


At the intersection points of the curves C\^ and C\^ and also at the intersection 
points of Cvg and C\g the Bainbridge compactification is smooth, but 

ar + ag = as and also ag + an = aio 

and hence this compactification is not minimal. In fact, C\g (and also C\^g) is a 
(—l)-curve and the corresponding values as and aio do not show up in the list of 
A/c- On the other hand, at the intersection point of and C^y the Bainbridge 
compactification has a quotient singularity of order two, since ag and generate an 
index two subgroup of It can be resolved by blowing up, adding a (—2)-curve, 
corresponding to the value VllAg = 6 + -x/iT that does not show up in the list of 
•\/4Ta„. The singularity can also be read off from the quadratic form [4, 5, —I] and 
Proposition 11.6. In terms of the convex hull of the multiminimizer fan has 
two interior points and skips two boundary points, as is shown in Figure 3 below. 

11.8. Relating the two compactifications and two continued fraction al¬ 
gorithms. The preceding examples show that the Bainbridge and the Hirzebruch 
compactification may sometimes agree, but that they are different in general. They 
also illustrate the general algorithm how to go from Bainbridge’s compactification 
to Hirzebruch’s: blow down curves corresponding to interior points and blow up 
points where boundary points of the lower convex hull have been omitted by the 
multiminimizers. 

At first glance the previous examples suggest that at least the length of the 
boundary cycles in the Bainbridge and the Hirzebruch compactification agree. This 
is true for class number one, but otherwise the truth is more subtle, as we now 
explain. 

To determine the geometry of the cusp a in the Bainbridge compactification, we 
need to run the multiminimizer algorithm given in §10.4 for A such that a = (1, A) 
and get as an output a G (n^)^. The geometry of Hirzebruch’s minimal smooth 
resolution, however, depends only on the square of a^, since we need to determine 
by the algorithm in §11.2 the lower convex hull of (a^)^ H 

Consequently, whenever the squaring map is not an isomorphism on the ideal 
class group of K = Q(\/D), only the reduced quadratic irrationalities x such that 
((l,x))^ is a square appear as label in the cusp resolution of the Hilbert modular 
surface in the focus of our interest. However all the curves labeled by standard 
quadratic forms appear on Ad. An example to illustrate this is D = 65, where 
the class group (both in the narrow and wide sense) is of order two, so the multi¬ 
minimizers for both cusps lie in an ideal equivalent to Oq^. Representatives of the 
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Figure 3. The multiminimizer fan (thick black points) and the 
Hirzebruch fan (circles) for D = 41, connected by red respectively 
by black lines. The rightmost thick black point is an interior point 
of the convex hull. Another point of the Hirzebruch fan skipped 
by the multiminimizer fan is not drawn (far to the right, close to 
the horizontal axis) 


multiminimizers are 


MM(065) 

Z:^MM((1,^^J^)) 


f 5+V65 7+V65 23+3V65 8+V65 153+19^65 
1 2vdi5 ’ 2^65 ’ 2^65 ’ V65 ’ 2^65 ’ 

137+17%/65 395+49%/65 1685+209%/65 1943+241%/65 
2\/65 ’ 2V^ ’ 2\/^ ’ 

f 5+\/65 7+y/^ 8+\/65 137+17\/65 
1 2v1i5 ’ 2\/65 ’ \/65 ’ 2\/65 

395+49vdi5 653+81vdi5 911+113^65 1 
2^65 ’ 2^65 ’ 2^65 J 


while the elements on the convex hull, up to multiplication by , are given by 


f 5+%/65 7+%/65 8+%/65 137+17%/65 395+49%/65 653+81%/65 

1 2v^ ’ 2V^ ’ VoE ’ 2^65 ’ 2^65 ’ 2^65 ’ 

911 + 113\/65 1169+145\/65 1427+177\/65 1 
2\/65 ’ 2 ’ 2 / 


There is however an equality of total lengths of cusp resolution cycles, if one 
takes all Hilbert modular surfaces ]HI^/SL(b © 0_d) into account. We start our 
considerations on the level of quadratic irrationalities. 
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Reduced quadratic irrationalities as defined in (93) are well-known and have 
been used to label the boundary curves of Hirzebruch’s compactification. Standard 
quadratic irrationalities as defined in (80) have been used to label the boundary 
curves of Bainbridge’s compactification. There is an obvious bijection between these 
two classes, given by 


reduced 


X- ^ 

^ A-1 


standard 

a: > 1 > > 0 

\ 

^ x-1 

} 

A > 1 > 0 > A'^ 


In order to pass from this correspondence of quadratic irrationalities to cusp 
resolutions, we define 

C'®*‘^(a) = {{p,,X) G /U'^ X K ■. a = /r(l,A), A standard} 

and 

C"'®‘^(a) = {{p,x) G K'^/Uh X K : a = p{l,x), x reduced}. 

Then the map 

^ ^ ^ ^ 
is obviously a bijection. 

Proposition 11.10. For any a, the length of the cycle of the Bainbridge compact¬ 
ification for the cusp corresponding to is egual to the length of the cycle of the 
Hirzebruch compactification for the cusp corresponding to a. 

Proof. Curves in the Bainbridge compactification for are in bijection to C'®*‘^(a^) 
by Theorem 11.5 together with Proposition 10.4. Curves in the Hirzebruch com¬ 
pactification for a are in bijection to C'’'®‘^(a^) by the algorithm in §11.2. We now 
can use the identification (jj. □ 

This explains the initial observations for D = 17 and D = A1 and in general for 
a = = Od- For a more symmetric formulation we recall that the isomorphism 

class of the Hilbert modular surface ]HI^/SL(b©OD) depends on b only up to squares 
of ideals and multiplication by a totally positive element in K. Consequently, there 
are 2*“^ = |Ker(Sq : Cl’^(D) —>■ C1’^(D))| Hilbert modular surfaces for a given D, 
where t is the number of distinct prime factors of D. This also implies that every 
ideal class a (in the wide sense) appears 2*“^ times as the module M associated 
with a cusp IF?/G{M, V) on the total collection of Hilbert modular surfaces for a 
given D. Altogether, this implies that the number 

E 1 

[a,b,c] standard 
—4ac=D 

appears in two incarnations. 

Proposition 11.11. The total number of boundary curves of the Bainbridge com¬ 
pactification of Xo is equal to i{D). 

The total number of boundary curves of the Hirzebruch compactifications of 
the Hilbert modular surfaces ]HI^/SL(bi © On) for bi in a set of representatives 
o/e/+(£>)/Sq(ei+(il)) is equal to 2*-H{D). 


e{D) = E 1 = 

[a,b,c] reduced 
b^ —4ac=D 





THETA DERIVATIVES 


77 


We could presumably make this statement even more symmetrical if we com- 
pactified all Hilbert modular surfaces ]HI^/SL(b©0_D) using multiminimizers defined 
by theta functions, but we leave it to the reader to explore this. 


12. UNIFORMIZATION and DISJOINTNESS FROM THE REDUCIBLE LOCUS 

The aim of this section is to give an independent proof of ‘Wd is a Teichmiiller 
curve” using the definition via theta functions and applying Theorem 5.3. We 
emphasize that we derive all properties of Wd ah ovo, i.e., using just the defini¬ 
tion (76) as the vanishing locus of D6 and without using anything that follows from 
the “geodesic” definition. Here again we simplify our counting task by restricting 
to fundamental discriminants. 


12.1. Transversality of Wd to the foliation Ti. Recall that in §5.3 we defined 
the “first Hilbert modular foliation” Ti of the Hilbert modular surface Xd to be 
the foliation defined by the constancy of the first cooordinate in the uniformization 
(i.e., by the equation dzi = 0, which is invariant under the action of the Hilbert 
modular group). Let : H —>■ H be a holomorphic map such that 2 ; 1 —>■ (z,ip{z)) 
defines a branch of the vanishing locus of T>9, and suppose that the corresponding 
component Wd of the vanishing locus has the uniformization Wd = H/T. 

Theorem 12.1. Suppose that D is a fundamental discriminant. Then the restric¬ 
tion to Wd of the derivative T)^6 = is a ip-twisted modular form of weight 

(3, 11 ) for r that vanishes only at the cusps of Wd- 


Since the vanishing locus of a holomorphic function F is transversal to Ti at a 
point p if and only if -^F{p) ^ 0 , the theorem immediately implies the transver¬ 
sality we want to prove. 

For all (m,m') G ( 5 ^^)^ and for any basis uj = ( 011 ,^ 2 ) of an 0-ideal a, define 


Dl 




u=0 


where the Siegel modular embedding is defined using a;. If we drop the index 
u) we tacitly assume that we have chosen some basis of 0, as we did in (78). It 
follows that 

l&{m,m'){zi,Z2) = — (^1 , ^ 2 )) 

(by the heat equation or by direct computation). Consequently, 




D^e{zi,Z 2 ) = ^ ( 21 , 2 : 2 ) £)26'(m.m')(2l,^2), 

(mo,m^) odd (m,m') odd 

explaining also the name given to this twisted modular form. 


Proof. By applying the chain rule one sees that the restriction of the z^-derivative 
of a Hilbert modular form to the vanishing locus of the form satisfies a modular 
transformation property with respect to the subgroup stabilizing this vanishing 
locus. This calculation also shows that the z^-derivative increases the weight in the 
fth component by two, proving the first claim. 

To prove the second claim we will show that the vanishing orders of D^9 at 
the cusps sum up to the total vanishing order of a twisted modular form of this 
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bi-weight. Since D^0 is holomorphic, it cannot then have any zeros at finite points. 
On a minimal compactification of the Hilbert modular surface Xd the number 
of intersection points of two modular forms of bi-weights {ki,£i) and (fc 2 ,f? 2 ) is 
j{kii 2 + k 2 £i)\xi^D)\- This follows e.g. from [39], Section IV.2. This calculation 
is still valid when intersecting the vanishing locus Wd oi & modular form with a 
section of the bundle of modular forms of bi-weight (^ 2 ,^ 2 ) to Wd- Since here 
(fci,£i) = (3,9) and (^ 2 ,^ 2 ) = (3,11), the function T)^0a has 15 |x(V_d)| zeros on 
the closure oiVVo- We have to show that they all lie at the cusps. 

Note that precisely one of the two quadratic forms [a, b, c] and [—c, —5, —a] of 
discriminant D with a > 0 and c < 0 satisfies the additional condition a -I- 6 -b c < 0 
required to make it a standard quadratic form, since a + b c = 0 would imply 
that H is a square. From (34) we consequently deduce that 

15x(Vd)| = ^ ^ a = ^ (a+|c|). 

D=b^ —4ac [ajb.c] standard 

a>0, c<0 D = b^ —4ac 

To complete the proof it thus suffices to show that at each of the g cusps oiWo 
corresponding to A, the vanishing order of 'D^O is at least (and hence precisely) equal 
to (a + \c\)/2g. Here A is a zero of the standard form [a, 6, c] and g = gcd(a, c). 
(The order of zero may indeed be half-integral, in accordance with the fact that T)9 
is a modular form with a quadratic character.) 

By Lemma 10.5 the vanishing orders of at the cusps of Wd mapping to a 
can be computed as the vanishing orders of 

®^6'a(zi,^2) = Y, -D|6»(„q,^/^)^^(z1,Z2) D2B{jn,m'),u>{zi,Z2), 

(mo,mQ)odd (m,m')odd 

on the Hilbert modular surface XD.a at the cusps mapping to infinity. By Theo¬ 
rem 10.1 we associate a multiminimizer a to such a cusp and we may assume that 
tjj = (wi, UI 2 ) has been chosen to be the distinguished bases for this multiminimizer. 
If t is a local parameter of a cusps of Wd, as in the proof of Theorem 10.1 then the 
terms appearing in the expansion of T* 2 f*(mo,m^),w and are ^ 

where Xi = Xi + rrii with Xi G Z and where 



as in (89). The minimal t-exponents are greater or equal to (a-|-|c|)/ 8 ( 7 , a/8g, |c|/ 8 g 
in case m is equal to (i, i), (|, 0 ) and ( 0 , i) respectively, both for 020 and D^B- 
Since for each of these m there are precisely two m' such that (m,m') is odd, the 
total vanishing order is at least (a -I- \c\)/2g^ which is what we wanted to show. □ 

12.2. Disjointness of Wd from the reducible locus (by counting zeros). 

In this and the following subsection we give two completely different proofs of the 
following result, which is the second half of what we need to apply the criteria of 
Theorem 5.3 and show that the vanishing locus of D9 is a Teichmiiller curve. The 
first proof is similar to the one used for Theorem 12.1, by comparing the number 
of known zeros of a twisted modular form with its total number of zeros. 

Theorem 12.2. The vanishing locus Wd ofD9 is disjoint in Xd from the reducible 
locus Pd- 
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Proof. Recall that the reducible locus is the vanishing locus of the product of all 10 
even theta functions. This product is a Hilbert modular form of weight (5, 5), so its 
restriction to Wd is a modular form for Wd of bi-weight (5, 5). As in the preceding 
proof we deduce that the degree of its divisor (on a compactification of Wd) is 
i(5-3 + 5-9)|x(Ac)| = 15|x(Azr)|. 

As in the preceding proof it suffices to show that the restriction of the product 
T\(mTa')ewen^(m,m') to Wd vaffishes at each of the g cusps of Wd corresponding 
to A to the order at least (a + \c\)/2g. Here again we can work at the cusp oo of 
Xo.a- There, by the same argument as in Lemma 10.5, the product of the ten even 
theta functions is given by Y[(m m')even^{'m,m’),un where lj is some basis of a and 
where 0(m,m').ixii^) = V'<.o(z)) with the modular embedding • 04 ^ defined 

using uj. The rest of the proof proceeds as above. To each cusp we associate its 
multiminimizer and take uj to be the distinguished basis. The terms appearing 
in the expansion of 9(rn,m'),ui at such a cusp are ^ where Xi = Xi + rrii 

with Xi G h and where F = ^,0, ^ . The minimal t-exponents are greater or 

equal to (a + \c\)/Sg, a/8g, \c\/8g and 0 in case m is equal to (i, A), (i, 0 ), ( 0 , i) 
and (0, 0) respectively. Each of the first three cases occurs twice among the ten 
even theta characteristics (and the irrelevant last case four times). Summing up 
these contributions gives again the vanishing order at least (a + \c\)/2g that we 
claimed. □ 


12.3. Disjointness of Wd from the reducible locus (via theta products). In 

this subsection we give a proof of Theorem 12.2 based on a completely different idea, 
by establishing a formula for the restriction of theta derivatives to the reducible 
locus. Let 

eo,{z) = Y.q-^/\ e.,{z)= Y. 

nGZ neZ+1/2 

riGZ n&+l/2 

(Here 9ii should perhaps be called 9'i i, since the corresponding Jacobi Thetanull- 
2 2 2 2 

wert vanishes identically, but with these notations it will be easier to write a closed 
expression.) The product formulas found by Jacobi for these four functions, which 
in a modern notation say that they are equal to r]{2z)^/g{z)‘^g{4:z)‘^, 2r]{Az)‘^/g{2z), 
r]{z)‘^/r](2z) and r]{z)^ , respectively, show that none of these functions vanish any¬ 
where in the upper half-plane. Recall from Proposition 5.1 that the reducible locus 
is the union of irreducible curves Pd.v = Fm{v) for v = , where D = + iN 

with N G N. Therefore Theorem 12.2 follows from the following theorem, which 
for simplicity we formulate only for D odd, the case of even D being similar. 

Theorem 12.3. Let D = 1 (mod 4) be a fundamental discriminant. Then for any 
odd theta characteristic (m, m') the restriction of the modular form D29(^rn,m') to 
the curve Pd.v for v as above has the factorization 
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as a product of Jacobi theta functions, where rhi and m '2 are defined by 

rhi = mi + m 2 mod (1), rh'^ = m'l + m^ mod (1) if r = 1 mod 4 
rhi = mi, m 2 = m 2 , if r = 3 mod 4. 

for m = (mi,m 2 ), m' = In particular, this restriction vanishes only at 

cusps. 


Proof. As above we use a tilde to denote elements of the shifted lattice, i.e., Xi = 
Xi + mi. The restriction of the theta derivative to Pd v is 


D2e 


'20(r. 


')(’■ 


(xi,X2)€Z^ 


where p(xi,X 2 ) = xiP X 2 . With the Z-linear transformation yi = xi + ^^ 0:2 
y 2 = —X 2 we obtain 


and 


j. _ ^ 

p{xi,X2) = yi-\ -r- y2 = yi + y'Dv‘"y2 =■■ T{yi,y2) 


D 2 d{m,m'){i^z,v'^ z) = Y e(j/i, 2 / 2 )r (j/i, 1/2 , 

(yi,y2)eZ2+(mi,m2) 


where 

elfyi,y2) = (_l)2(yi-Ai)"t'i+2(y2-™2)rfi2 

and where we have used tr(i2T(i/i, 1 / 2 )^) = Di + Ny 2 - The g-exponent is invariant 
under both yi i-A —yi and y 2 >—>■ —^ 2 - Under, say, y 2 >—>■ —1/2 the sign of e(yi,y 2 ) is 
unchanged unless m 2 = m^ = 1/2. Hence, unless m 2 = m^ = 1/2, the ^/lJv'^y 2 - 
contribution of T{yi,y 2 ) cancels and one checks the formula by multiplying the 
unary theta functions. For an odd theta constant (m, m') precisely one of the cases 
mi = ifi'i = 1/2 or m 2 = m '2 = 1/2 happens and a similar cancellation gives the 
formula in the second case, too. Notice that in each case of the theorem, one of 
the theta series in the decomposition is the function 6^^ of weight 3/2, so that the 
total weight is always 2. □ 


Open problem: Can one reprove the irreducibility, stated in Theorem 5.2 and 
proved by McMullen using combinatorial number theory of the set of cusps, exclu¬ 
sively with techniques of (Hilbert) modular forms? 


13. Applications 


13.1. The modular embedding via theta functions. Using the description of 
IUd as vanishing locus of DO, we can now give the modular embedding cp as in the 
“Fourier expansion” as defined in (2) for any cusp of Wd- In fact most of this was 
already achieved in the proof of Theorem 10.1. There we identified a cusp of Wd 
with a standard quadratic form [a, b, c] (and hence a multiminimizer a) together 
with an element in r G Z/gZ where g = gcd(a, c) (equivalently, a solution S of (90)). 
Recall from this proof that for each such solution there is a unique branch, given 
by (85), of the locus D20(m,m'),a{zi, Z 2 ) = 0. The map z !->■ {z, p{z)) describing this 
branch was given by 


ip{z) 


21. + C + JM. 

a ziri 


( 102 ) 
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We can now describe the arithmetic properties of this expansion. 

Theorem 13.1. The coefficients of the modular embedding describing the branch 
determined by the quadratic form [a, b, c] and number S as in (90) through the cusp 
a of Xj) have the following properties. 

i) The constantC in (102) belongs to \ogK^. In fact, G K\Q. 

a) The number S lies in . 

Hi) For each (3 completing the multiminimizer a determined by [a, b, c] to a basis of 
(a^)^ there exists A G <C* such that in the local parameter Q = Aq of the cusp 

e{iyz + ir^^iz)) = Qtr(a.)ga(.)e(«) v G . 

The scalar A is transcendental of Gelfond-Schneider type, more precisely of the 
form with x and y in K \ Q. 

iv) The coefficients a„ of the power series e{q) — X]ri>i expanded in a local 

parameter Q as in ii) lie in the number field K{S). 

Proof. Statement i) obviously follows from (90) and (86). For ii) it suffices to test 
for iz the dual basis {a*, (3*} C of {a, (3}. Plugging in (3* confirms that S here 
is the same as in (86), since j3* = /N{off^/15. Plugging in a* = f3'^ /NVD 
implies that 

A = e( - 

ViV(a)2/D/ 

The last statement hi) follows since the coefficients of Fourier expansion of the 
theta function lie in K{S) and solving recursively for the an involves only these 
coefficients and integral powers of S. For concreteness, we perform the first step in 
this procedure in the case |c| < a. The summands {xi,X 2 ) = (±^,±^) contribute 
to the lowest order term (in Q) of the restriction of the theta derivative to the 
branch {z,(p{z). The next lowest order term is determined by {xi,X 2 ) = (± 5 ,±|) 
and we can solve for 

^ c,a+2bs pUh |) + §)) 

i)(cr(a*)J^(i, i) + CT(/3*)G(i, i)) 

using the notation introduced in the lines before (83), where F and G given 
in (89) and g = sa + tc. □ 

We remark that the “A” of §7.1 is not quite the same as the one above, and 
that the numerical value given in (49) is not of the form x"^ with x, y G K, but an 
algebraic multiple of this. This is because we normalized our Q in §7.1 so that the 
expansion oit = Q + --- had leading coefficient 1. If we changed Q by the algebraic 
factor, then t and y would still have Q-expansions with coefficients in K, so that 
this would be an equally good choice in the special case D = 17, but for the general 
statement it seems best to normalize Q as above. 


13.2. Fourier coefficients of twisted modular forms. Fix D, a fractional Od- 
ideal a, and a branch of the vanishing locus of T)0 through the cusp at oo of the 
Hilbert modular surface X£,,a given by the quadratic form [a,b,c] and S as above. 
Let F C SL(a''©a) be the subgroup stabilizing this branch. It is the Fuchsian group 
uniformizing the curve Wd, normalized so that the cusp labeled with ([a, b, c], S) is 
the cusp at oo. 
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The first twisted modular form for F we encountered was the form ^p' {z) of bi¬ 
weight (2, —2) in the case D = \fYj. By the preceding theorem we know that this 
modular form has a Fourier expansion of the form 

n>0 

with A transcendental of Gelfond-Schneider type and in the field (= K in 

this case, since g = 1). We will show that such a statement holds for all twisted 
modular forms. 

This section is inspired by work of Wolfart in the case of non-compact Fuch- 
sian triangle groups. Let A{oo, q,r) be such a group. We normalize it so that 
A(oo, q, r) C SL(2, Q) and that oo is a cusp of A(oo, q, r). In this situation Wolfart 
shows in [41]) that there exists some A G C such that the space of (ordinary, i.e. 
of bi-weight (fc, 0)) modular forms for A(oo, q,r) admits a basis given by forms fm 
with Fourier expansions 

fmiz) = rm,n{Aq)'^ with r^.n rational, 

n>0 

where q = e{z/ao). The constant A is transcendental of Gelfond-Schneider type 
if the Fuchsian group is non-arithmetic and is algebraic in the finitely many other 
cases. 

The uniformizing group of the curves Wd is non-arithmetic and the following 
result extends Wolfart’s non-algebraicity result to this class of curves. 

Theorem 13.2. The space of twisted modular form of all bi-weights for the uni¬ 
formizing group F of Wo has a basis of forms with Fourier expansions X]n>o 
with Gn G K{S) with Q = Ae{z/a) and S as in Theorem 13.1 ii). The num¬ 
ber A, and also the radius of convergence \A\ of this series, is transcendental of 
Gelfond-Schneider type. 

Wolfart’s proof of A being of Gelfond-Schneider type relies on properties of F- 
functions and trigonometric calculations. It overlaps with our result in the few 
cases {D = 5,8,12) where the uniformizing group of Wo is a triangle group. The 
rationality of r„ in Wolfart’s result follows easily from the rationality of the coeffi¬ 
cients of the Picard-Fuchs differential operators. This rationality does not hold for 
general D, as can be seen from our example (41). The statement of the theorem 
can indeed not be strengthened to rationality of the coefficients, even in the case 
k = £ = 0, since the modular function t in (46) admits no rescaling that has rational 
coefficients. 


Proof. For each (fc, i) the space of Hilbert modular forms of weight {k, £) has a 
basis of forms whose Fourier expansions have rational coefficients. Suppose that 
/ = Ci,e{iyzi-\-iy'^Z 2 ) is such a basis element. Then the restriction to (zi, 2 : 2 ) = 

(z, (p{z)) with (p as in (102) is 


/(?) = 




Qt''(a!y)gCr(!y)£(g) 


and so for this twisted modular form the claim directly follows from Theorem 13.1. 

For k and £ sufficiently large the restriction map is surjective since £1 0 £2 is 
ample on A^. Here £1 and £2 are the natural line bundles on such that Hilbert 
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modular forms of weight {k,tj are sections of ® (The surjectivity in fact 
holds already for fc > 4 and £ > 10. To see this, we tensor the structure sequence 
for Wd C Xd with and note that the cokernel of the restriction map lies 

in H^{Xd, Owd 0^1 Since 3wd — Oad(—ITd) = and since Li^L 2 

is ample on Xjj, Kodaira’s vanishing theorem implies the claim.) 

For the remaining cases, note that with / and / also 1// and fg have Fourier 
expansions as claimed in the theorem. It thus suffices to consider the products of 
modular forms of bi-weight (fc, £) with a given modular form of large enough weight 
and then to apply the restriction argument. □ 


13.3. The foliation by constant absolute periods. As quotients of H x H, 
Hilbert modular surfaces come with two natural holomorphic foliations, which 
we called Ti and 3^2 ■ They also admit an interesting foliation defined using the 
SL(2, IR)-action on the space HM 2 . The leaves of this foliation are upper half¬ 
planes, but the foliation is not holomorphic. It is studied in detail in [27]. In the 
context of studying the SL(2, K.)-action a natural local coordinate system on HM 2 
is given by period coordinates, i.e. by integrating the holomorphic one-form oj along 
a chosen basis of the first homology relative to the zeros of w. In the special case 
of genus two, we have used this coordinate system in (95). 

We may identify the (Torelli preimage of a) Hilbert modular surface as a subset of 
Pf2M2 = (HM 2 )/C* by mapping X to the class of (A, w), where oj is the eigenform 
for the first embedding of K (in the order that we have chosen once and for all). It 
follows from [2] or [27] that the foliation by constant absolute periods, where only 
xe defined in (95) is allowed to vary, is the first foliation Ti. The function is 
not globally well-defined: its sign depends on the choice of an orientation and it 
may also be altered by a constant by adding the period of a closed loop. However 
q = ((IxeY i® S' well-defined quadratic differential on each leaf of Ti, independent 
of these choices. 

The horizontal trajectories of this quadratic differential exhibit beautiful struc¬ 
tures on the leaves of Ti. They have been determined by McMullen in [29], using 
the following theorem. 


Theorem 13.3. The quadratic dijferential q is proportional to the restriction of 
the meromorphic modular form 


Q{Z1,Z2) 


n ^29 (m,m ')(^1A2) 

,m') odd 



)(^l,^2) 


of weight (—2,4) to the leaf where zi is constant. 


Proof. This follows directly from [2], Theorem 10.2, and Theorem 9.1. More pre¬ 
cisely, Bainbridge has determined the dependence on zi. Since the quadratic differ¬ 
ential depends on the choice of a holomorphic one-form on each Riemann surface, 
there exists a linear map Qi from the first eigenform bundle to quadratic differ¬ 
entials on the leaves of Ti that is locally defined hy q = {dxE)‘^. Such a map is 
the same object as a meromorphic modular form of weight (—2,4). This modu¬ 
lar form Qi vanishes at Wd, where the zeros collide, and acquires a pole at the 
reducible locus Pd, where the zeros are infinitely far apart. It is shown moreover 
in [2], Theorem 10.2, that both the vanishing order and the pole order are equal to 
one. Consequently, by Theorem 9.1 and the fact that the even theta characteristics 
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vanish precisely at the reducible locus, QjQi is a holomorphic function on Xd- 
It then extends holomorphically to the Baily-Borel compactification by Hartog’s 
theorem, since the boundary has codimension two. We deduce that QjQi has to 
be constant, which proves the claim. □ 
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